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Your pocket calculator can be an 
amazingly powerful instrument, and 
can even perform calculation? that 
are usually left to large computers. 
This is the first and only book that 
shows you how to get the most out 
of your pocket calculator for scien¬ 
tific and engineering applications— 
you’ll discover more uses than you 
ever thought possible! 

Ideal for all forms of analysis, this 
book gives you numerical tech¬ 
niques, approximations, tables, 
graphs, and flow charts for perform¬ 
ing quick, accurate calculations. The 
fast, efficient methods you’ll learn 
are “sized” for pocket calculator 
analysis and written in forms that 
minimize your workload (thus reduc¬ 
ing your chance for error). The book 
emphasizes methods for such spe¬ 
cific types of data processing as 
Fourier analysis, linear system simu¬ 
lation, gradient optimization, and 
statistical analysis, presented in 
forms that are directly applicable to 
the needs of engineers, scientists, 
and scientific programmers. 

The discussion of each numerical 
method includes: 

• a consistent and careful treatment 
of its use in numerical analysis 

• a detailed discussion of how each 
method is used in pocket calcula¬ 
tor work 

• a description of each method’s li¬ 
mitations and advantages (where 
applicable) 

• a tabulation of useful formulas in 
convenient forms for pocket calcu¬ 
lator analysis 

This book will show you how your 
pocket calculator can— 

• become a “Learning Machine” 

• help you learn new technologies 

• be used for your own research and 
discovery in the field of numerical 
analysis 
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PREFACE 


This book is written for all those who own or operate a modern electronic 
pocket or desk calculator, and especially engineers, scientists, science 
students, mathematicians, statisticians, physicists, chemists, computer ana¬ 
lysts, and science educators. 

When the right numerical methods are used, the electronic pocket 
calculator becomes a very powerful computing instrument. 
“Micronumerical methods” that will help the reader to derive the most 
computing capability for every dollar he has spent on his pocket calculator 
are discussed here. 

Most of the methods work on any pocket calculator. Special methods for 
certain types of machines are clearly indicated where necessary. Key stroke 
sequences for both algebraic and reverse-polish calculators are shown. 
Virtually all pocket calculator keyboards and capabilities were considered 
in preparing this book, to ensure that the numerical methods presented are 
the most universally applicable for general pocket calculator analysis. 

Each part of this book provides a consistent and careful treatment of the 
methods and tabulated formulas that can be used with a pocket calculator. 
The aim is to supply the reader with a large number of numerical 
techniques, numerical approximations, tables, useful graphs, and flow 
charts for performing quick and accurate calculations with pocket calcula¬ 
tors. 

The numerical methods are presented from four viewpoints: 

1. The numerical evaluation aspect of each numerical method. 

2. The manner in which each method is used. 

3. The limitations and advantages of each method. 

4. The tabulation of useful formulas in forms that are most convenient 
for pocket calculator analysis. 
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Preface 


Emphasis is also given to numerical methods used in certain types of data 
processing, such as harmonic and statistical analysis. And they are pre¬ 
sented in forms that are directly useful to engineers, scientists, and pro¬ 
grammers. 

The premise of this book is that the pocket calculator provides the 
scientific analyst with an important new dimension in analysis. Obviously 
the pocket calculator is useful both for numerically evaluating functions 
and for processing data. In addition , it enables the analyst to quickly gain 
detailed and quantified knowledge about any technical discipline (his own 
or another’s) by learning its mathematical models and tools through use 
and experimentation on the pocket calculator. In short, the pocket cal¬ 
culator becomes a learning machine for the scientific analyst. A scientific 
analyst no longer need first develop a mathematical model for a complex 
process or system being studied and then turn it over to a computer 
programmer for its numerical evaluation. Instead, he can numerically 
evaluate complex functions (and thus analyze complex problems) on even 
the simplest four-function calculators in his home or office. 

Finally, throughout the world scientific analysts working on pocket 
calculators are inventing their own numerical methods for evaluating 
problems in their specific disciplines. In this sense, the pocket calculator is 
a research tool which the analyst can use to develop his own numerical 
methods for his own purposes. 

Throughout the book I give more attention to subjects of interest to the 
practitioner than to those of interest to the theorist. Though the treatment 
of this material is mathematical, I have not strived for conciseness or rigor 
beyond that required for pocket calculator analysis. Numerous examples 
of each technique and method are given, and their implementation is 
discussed in detail. 

This book consists of four parts that are subdivided into 12 chapters, 
each dealing with topics in numerical analysis that are useful to the 
practical analyst. I have tried to avoid overgeneralization in the treatment 
of these topics, since numerical analysis is an art as much as it is a science. 
Part / of the book introduces the spectrum of pocket calculators (including 
their capabilities and their limitations) available to engineers and scientists. 
Particular attention is given to the unique computing features of interest to 
the scientific analyst. Part I also presents mathematical preliminaries and 
mathematical refresher material and develops certain elementary numeri¬ 
cal methods particularly suited to analysis on the pocket calculator. Topics 
from arithmetic to algebra and analysis with complex variables are co¬ 
vered. 

Part II presents numerical methods and formulas for numerically 
evaluating advanced mathematical functions. It also deals with the nested 
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parenthetical form of the most frequently used functions in advanced 
engineering mathematics. It is the nesting of a sequence of arithmetic 
operations in parenthetical form that is the basis for performing advanced 
analysis on the pocket calculator. For example, 14 multiplies, 2 divides, 2 
sums, and 108 data entries, totaling 126 key strokes and 5 data storage 
records, are needed for a three-digit floating-point evaluation of sin (x) 
^x — x 3 /3! + x 5 /5!. But only 54 key strokes and no data storage records 
(on a scratch pad) are needed to evaluate sin (x)«x(l — (x 2 /6)(l — x 2 /20)) 
to the same accuracy. Though we would evaluate sin (x) in this manner 
only on a four-function calculator, this example does illustrate the point 
that many complex formulas usually requiring calculator memory to be 
numerically evaluated can be written in a “nested” form not requiring 
calculator memory and thus can be evaluated conveniently on even the 
simplest four-function pocket calculator. 

The nested parenthetical form is considered to be a “fast” form for 
numerical evaluation. That is, functions written in nested parenthetical 
forms require fewer operations to numerically evaluate than do the same 
functions in their “simplest algebraic form.” The nested forms are there¬ 
fore evaluated more rapidly than their unnested counterparts. 

In Part II the topics of recurrence formulas for evaluating advanced 
functions and performing analysis are also covered. Recurrence formulas 
are unique in that they are infinite memory forms of otherwise finite memory 
form calculations. The formulas give the pocket calculator “virtually” an 
infinite memory for storing data, which creates many numerical methods 
for data processing (as in statistics) to be rewritten in recursive form for 
pocket calculator analysis. Here, again, the emphasis is on making even the 
simplest four-function calculator capable of doing sophisticated analysis 
without memory. Such concepts as nested parenthetical forms and recur¬ 
sion formulas, when combined with those of Chebyshev economization 
and rational polynomial approximation, provide tremendous flexibility and 
accuracy in the numerical evaluation of even the most complex functions 
on the simplest four-function pocket calculator. In fact, the serious analyst 
can perform precision calculations unheard of until a few years ago—in 
the comfort and convenience of his home or while traveling on the job. 
Part III examines the methods and formulas for performing advanced 
“types” of analysis on the pocket calculator. Included are such topics as 
numerical evaluation of definite integrals and methods for numerical 
differentiation of data sets, solving differential equations, simulating linear 
processes, performing statistical analysis, and performing harmonic analy¬ 
sis. 

Part IV deals exclusively with analysis on the advanced programmable 
pocket calculator. The chapters illustrate conclusively the leap in comput- 
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ing capability produced by the pocket calculator. They are based on 
personal experience in solving a very large number of problems on the 
programmable pocket calculator developed by the Hewlett-Packard 
Corporation. The discussion is general, however, recognizing that more 
programmable pocket-style calculators are being developed. 

This book grew out of eight years of study on numerical methods for 
analysis on the digital computer. These methods were revised over a period 
of three years to make them applicable to desk calculator analysis and 
eventually to pocket calculator analysis. A number of the methods have 
been available to the analyst in scattered literature, such as user’s guides 
and manuals for desk-top and pocket calculators, journal articles, and 
some textbooks. A large part of the material was developed by the author or 
was provided by my associates in industry. I am particularly indebted to my 
associates, at the Software Research Corporation and the McDonnell- 
Douglas Corporation. They generously shared with me many of their 
“tricks of the trade” and suggested interesting problems for this book. I 
express my sincere appreciation to one of the great numerical analysts of 
our time, Dr. Richard Hamming, of Bell Laboratories, for his review and 
improvements to the manuscript. 

My thanks to the people at Hewlett-Packard who reviewed and critiqued 
the manuscript, and in particular to the HP-65 chief engineer, Mr. Chung 
Tung. 

I want to thank Joseph and Sara Goldstein, who taught me the 
Goldstein algorithm—“one at a time.” 

To my wife, Laurie, my special appreciation for putting up with the 4 
a.m. Writing schedule. 

Finally, my thanks go to Mrs. Florence Piaget who typed the manuscript 
and helped me to prepare it for publication. 


Jon M. Smith 


St. Louis, Missouri 
August 1974 
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CHAPTER 1 


THE POCKET CALCULATOR 


1.1 INTRODUCTION 

This chapter discusses the mathematical differences among the various 
pocket calculators and certain mathematical concepts, useful for analysis 
on the pocket calculator, that appear throughout this book. 

We are not so much concerned with the hardware implementation of 
mathematical operands and operations as with the different ways in which 
they can be assembled in a computing machine—the hardware architec¬ 
ture. Only the most obvious mathematical aspects of calculator design are 
examined, such as the language used, the size and type of memory, the 
instruction set, type of input/output, and whether the calculator is pro¬ 
grammable. There are some 432 types of calculators that could be hard¬ 
ware implemented. An entire book could be written on this subject alone. 
Here we limit our discussion to the more important mathematical 
differences that result from the various hardware implementations in order 
to: 


1. Understand pocket calculators and the organization of mathematics 
within them. 

2. Determine, in a cursory way, the combinations of hardware imple¬ 
mentation that result in a significant jump in calculating capability. 

The purpose is to narrow the types of calculator to be considered in this 
book to three. 

Three hypothetical calculators that are typical of the available and 
anticipated pocket computing machines are discussed. Care has been taken 
throughout not to limit the methods of analysis to any particular hardware 
implementation. In fact, if there is bias throughout the writing it is in the 
direction of anticipated developments in the pocket calculator field, though 
its overall effect on the material is negligible. 
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The Pocket Calculator 


The following mathematical aspects are covered in this chapter: 

1. Arithmetic calculations. 

2. Function evaluation with and without memory. 

3. Computational accurac}^ 

The first is a thorough introduction to what appears to be a mundane 
subject (arithmetic on the pocket calculator). In fact, it is found to be quite 
the opposite because the different languages used by different calculators 
lead to significantly different capability for handling complex problems. 

Particular attention is given to nested parenthetical forms of functions 
that permit function evaluation on memoryless and limited-memory cal¬ 
culators. Nested parenthetical forms are used as a means of providing 
implicit memory to the memoryless calculators. They are also “fast” in the 
sense that their evaluation involves fewer key strokes than the usual 
algebraic form. 

No chapter on mathematical preliminaries in a book on numerical 
analysis would be complete without a discussion of computational ac¬ 
curacy. Here we examine: 

1. The accuracy limitations of the typical pocket calculator. 

2. Ways in which to accurately evaluate functions in general, and on the 
pocket calculator in particular. 

1.2 MATHEMATICAL DIFFERENCES IN POCKET CALCULATORS 

Today’s pocket calculators differ mathematically in many ways. Only the 
six more commonly encountered mathematical distinctions are covered 
here. In a sense, these are the major distinctions because they are the 
fundamental issues addressed in the conceptual design of every pocket 
calculator. The important mathematical distinctions that are associated 
with the subtleties of detailed design are not discussed because the hard¬ 
ware implementations vary widely. Perhaps the best known difference is 
that between the use of fixed-point and floating-point numbers. 

The fixed-point numbers are those whose decimal point is fixed by the 
electronic circuitry. The difficulty is that when multiplying two large 
numbers together, so that the most significant digit exceeds the size of the 
numeric display, the number is truncated not in the least significant digits 
but in the most significant digits. Most fixed-point arithmetic computers 
have a symbol that is illuminated to indicate the overflow condition. 

Floating-point numbers have a decimal point that moves so as to retain 
the most significant digits in any calculation. When a number is computed 
that is larger than the calculator’s field of numbers and the decimal point 
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location is unknown, most calculators display the most significant digits 
and illuminate a symbol indicating that the decimal point location is 
unknown. 

In these two number systems, it should be noted, the number fields are 
dramatically different. In the floating-point number system the numbers 
are “bunched” around zero. In the fixed-point number system the numbers 
are uniformly distributed over the range of the number field. To see this, 
consider the process of incrementing each of these types of numbers on a 
pocket calculator. 

The smallest possible increment between any two numbers is the least 
significant digit in the numeric display. For an eight-digit display with a 
decimal point fixed in the third place, the smallest increment that can be 
added to any digit is 0.001. Now consider the addition of an increment to a 
floating-point number. Since the decimal point “floats” in the floating¬ 
point number system, the decimal point precedes the far-left digit. For an 
eight-digit display, the smallest number that can be added to zero in a 
floating-point number system is 0.00000001. Now consider incrementing a 
floating-point number when the decimal point is after the far-right digit. In 
this case, the smallest number that can be added to 99999998 is 1. The 
difference in the size of the “smallest number” when incrementing a full 
and empty register in floating-point numbers is a factor of 10 8 . 

Now consider the full range of the positive numbers in both number 
systems. The fixed-point numbers range from 0.001 to 99999.999. The 
difference between numbers, no matter where a number is over the range 
of the calculator, is 0.001. Thus the numbers over the range of fixed-point 
numbers are uniformly distributed. 

Again consider the range of the positive floating-point numbers, from 
99999999 to 0.00000001. Clearly, the range is greater in the floating-point 
number system than in the fixed-point number system, but note also that 
when the numbers are very small the distance between them is 0.00000001. 
When the register is full, the difference between the numbers is 1. 
Obviously, over the range of floating-point numbers, the distribution is not 
uniform. In fact, there are as many numbers grouped between 0 and 1 as 
there are between 1 and the full register size 99999999. 

It follows, then, that in fixed-point arithmetic the absolute difference 
remains fixed over the entire range of the number system, while in 
floating-point numbers the absolute difference varies significantly. It is 
worth emphasizing that in floating-point arithmetic the percentage 
difference remains fixed, while in the fixed-point arithmetic system the 
constant difference remains fixed over the range of numbers. As used here, 
percentage difference is the ratio of the difference between two consecutive 
numbers divided by the larger of the two. For most engineering analysis, 
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percentage difference and percentage error are usually the measure of 
accuracy of most interest. 

The floating-point number system is usually extended by powers of 10, 
permitting the positive floating-point numbers to range from 10 -99 to 
99999999 X 10". In fact, calculators are usually configured to display this 
extended number field in scientific notation. Interestingly, this even further 
bunches the floating-point numbers in the neighborhood of zero. Because 
of this grouping property of the floating-point numbers, the absolute errors 
are smaller for calculations with numbers between 0 and 1 than for 
numbers betwen 1 and the full range of the calculator. 

From a hardware architecture viewpoint, fixed-point numbers are usu¬ 
ally displayed with greater accuracy than floating-point numbers; and 
floating-point numbers are usually displayed with a greater dynamic range 
than fixed-point numbers. This can be seen by considering a register with 
eight display elements where we configure both fixed-and floating-point 
numbers. In fixed-point arithmetic, eight mantissa digits can be displayed. 
If the decimal point is allowed to be set by the decimal point key Q] , 
and a display element is used to show the decimal point, then only 7 digits 
remain to display the mantissa. If scientific notation is used to increase the 
dynamic range of the display, m +1 display elements are required to 
display m digits in the exponent. The extra display element is used to show 
the sign of the exponent. 


Required 

Display 

Power of 10 Display Elements 


\0 ±x (±) 0 ) 2 

10 ±x * (± )(*)(*) 3 


j Q ± XXX 


(±)WW^M 

m digits 


m + 1 


If 99 orders of magnitude are to be shown in the display register (the 
usual case with scientific pocket calculators) three display elements are 
required to display the exponent and its sign, leaving only five digits for 
displaying the mantissa. In this sense, then, the effect of increasing the 
display’s dynamic range is to reduce the number of digits for displaying a 
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mantissa, thus reducing the accuracy with which a number can be dis¬ 
played. 


1.3 INSTRUCTION AND DATA ENTRY METHODS 

We discuss three types of data entry methods (languages) commonly used 
in pocket calculators: polish, reverse-polish, and algebraic. In polish nota¬ 
tion, the operator precedes the operand. For example, to instruct the 
calculator to add the numbers A and B, in the polish entry method we 
would stroke the plus key, then enter the two numbers A and B. The 
logical operation in the machine would then display the result without the 
need for striking an additional key. In reverse-polish, the process is 
reversed; that is, the operands are introduced before the operator. In 
algebraic notation, the operator is sandwiched between the two operands. 
If we compute the sum of A and B in algebraic notation, we first input A , 
then stroke the summation key, follow that with an input of B, followed by 
stroking the equal key, whereupon C would be displayed in the register. It 
might seem that one entry method would result in many fewer key strokes 
than another entry method when numerically evaluating a function, but it 
turns out that the key strokes associated with instructing the calculator are 
fairly small compared with those associated with data entry. Far more 
important is the fact that certain entry methods, when combined with 
memory, result in the need for fewer data inputs or “scratch-pad” storage. 
The most common entry methods used in pocket calculators are the 
reverse-polish and algebraic methods, the former usually being used with 
machines that have a memory stack and the latter being attractive because 
of its “natural” algebraic treatment of numerically evaluating algebraic 
functions. 

The natural way in which the algebraic method is used to numerically 
evaluate algebraic functions can be seen in the following example. Con¬ 
sider the relation 

AxB+C=Y 

When evaluated on an algebraic notation pocket calculator (such as the 
Texas Instruments SR-10), the sequence of key strokes is* 

CL A XB+C= xxxxxxxx xx 

The same function evaluated on a reverse-polish notation calculator (such 

*Here the symbols CL, f, X, +, and = mean, respectively, “to clear the display register,” “to 
store what is in the register in a temporary location,” “to multiply,” “to add,” and “to present 
the answer—Algebraic language only.” 
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as the Hewlett-Packard-35) would involve the sequence of key strokes 
CL A'l B X C + xxxxxx xx 

It is apparent that the former is more natural for simple functions than the 
latter. Reverse-polish notation, when used in conjunction with memory 
stacks, has the convenient property that it easily implements the numerical 
evaluation of functions with parenthetical expressions. This is not the case 
with algebraic notation. For example, the sum of products 

(AxB) + {CxD) 

must be rewritten in the form 


( Ax B 

l D 


+ C 


D 


to be evaluated using algebraic notation without using a scratch pad. The 
key strokes and operation to evaluate the sum of products directly is 

CL Ax B = xxxx xx STORE ON SCRATCHPAD-] 

CL C X D = yyyy yy -I- INPUT xxxx xx = zzzz zz 

The key strokes to evaluate this sum of products in the rewritten form is 

CL A xB+D + CxD = zzzz zz 

The reverse-polish with stacks evaluates the sum of products conveniently 
with key strokes 


CL A\B X C'lD X + zzzz zz 

To avoid rewriting expressions in somewhat unfamiliar forms, the 
algebraic programming language can be designed to recognize a hierarchy 
among the operators, that is, when products are computed before sums or 
vice versa. The algebraic method with a “product-before-sum” hierarchy 
(such as the Texas Instruments SR-50) would evaluate the sum of products 
directly with the following key strokes: 

CL A xB+CxD = zzzz zz 

It has difficulty, however, with the expressions of the product of sums 

(A + B)X(C+D) 

in that the hierarchy is set up to “multiply-before-add” rather than 
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“add-before-multiply” which the product of sums requires. This problem is 
resolved with an additional storage location in which to store the inter¬ 
mediate sum. The key strokes are then., 

CL A + B = STO CL C+DX RCL = zzzz zz 
In reverse-polish with stacks, the key strokes are 

CL A]B+ C^D-^t X zzzz zz 

Here STO means “store in memory” and RCL means “recall from 
memory.” 

1.4 MEMORIES 

Pocket calculators are available with no memory, memory for a constant 
term, a memory stack of three to four registers, and in the more sophisti¬ 
cated machines addressable memory registers as well as the stack. The 
pocket calculator with a memory that simply retains a constant is 
characterized by the rather inflexible storage of a constant number that 
can be recalled or not recalled to the display register, as the operator 
desires. The stored constant can be used as a coefficient in multiple 
products or as a constant in multiple sums. The constant memory register 
does not automatically interact with the display register in most pocket 
calculators. 

Pocket calculators with memory stacks generally involve three or four 
registers that can be manually “pushed up ” and automatically “pushed 
down ” for the purpose of retaining numbers developed in the display 
register. When used in conjunction with reverse-polish notation, they 
provide the first quantum level of computing capability above that present 
in the simple four-function memoryless pocket calculator. Data are usually 
entered into a stack with an entry operation. The three stack registers of a 
Reverse-Polish machine can be filled with three different numbers and 
then, as the operation on the number in the display register and the 
bottommost number in the stack is called for, the result is displayed in the 
display register and the stack automatically moves down, bringing the 
second number in the stack now to the first number, and the third number 
to the second. This process can be continued until the stack is empty. The 
algebraic machine with hierarchy uses the stacks somewhat differently. 
When a key stroke sequence is to be evaluated (upon key stroking the 
equal sign), the calculator first looks for products to evaluate and put into 
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the memory stack and then executes sums (in a “multiply-before-add” 
hierarchy). The stack manipulations are automatic. 

In the more sophisticated calculators with addressable registers, the 
process of storing data in a register is similar to that of storing data in the 
memory of a computer. The addressable stack does not interact unless 
programmed to do so. This memory resembles the register for storing a 
constant in the simpler calculators. Here, however, the ability to address 
the register is involved. 


1.5 INSTRUCTION SET 

The basic “four-function” calculator has keys for instructing the calculator 
to add, substract, multiply, and divide. What is amazing is that these small 
four-function machines, purchased at relatively low cost, can provide 
tremendous computing power. Examples of the use of the four-function 
pocket calculator for evaluating some of the most sophisticated engineer¬ 
ing analysis will be seen later. Another arithmetic operation that can be 
performed with the four-function machine is computing powers of a given 
variable through repetitive multiply operations. While squaring a number 
involves only two multiplies, the number must be double entered. Thus, 
the simplest additional instruction that can be added to a pocket calculator 
that reduces the number of key strokes is the squaring operation or 
modifying the multiply instruction to square a number when only one data 
entry has been made. 

Entirely new capabilities are added when the square root and reciprocal 
instructions are added to the calculator instruction set. There is no single 
stroke way on a four-function calculator to numerically invert a number 
without using a scratch pad and double data entry.* A similar situation 
holds for the square root. Thus we find the next most sophisticated pocket 
calculator to be a seven-function calculator, including square, square root, 
and reciprocal functions implementable with a single key stroke. Beyond 
this, additional instructions are added to aid in special-purpose computing 
in a variety of ways. The underlying theme throughout the addition of 
functions to a pocket calculator keyboard is to reduce the number of key 
strokes associated with data inputs. 

Because we will be continually referring to instructions found on most 
scientific calculators, let us define the instruction sets that we use in this 
book: 


See Appendix A1-5. 



Key Symbol 

Key Name 

CL 

Clear 

30-0 

Digit 

□ 

Decimal point 

EE 

Enter exponent 

CHS 

Change sign 



Add 

Subtract 

Multiply 

Divide 

Square 



Square root 


Key Instruction 

Clears information in the cal¬ 
culator and display and sets 
the calculator at zero 

Enter numbers 0 through 9 to 
a limit of an eight-digit man¬ 
tissa and a two-digit exponent 

Enters a decimal point 

Instructs the calculator that 
the subsequent number is to 
be entered as an exponent of 
10 

Instructs the calculator to 
change the sign of the 
mantissa or exponent appear¬ 
ing in the display 

Instructs the calculator to add 

Instructs the calculator to sub¬ 
tract 

Instructs the calculator to 
multiply 

Instructs the calculator to di¬ 
vide 

Instructs the calculator to find 
the square of the number dis¬ 
played 

Instructs the calculator to find 
the square root of the number 
displayed 

ll 





Key Symbol 


Key Name 


Key Instruction 



Reciprocal Instructs the calculator to find 

the reciprocal of the number 
displayed 

Sine Instructs the calculator to de¬ 

termine the sine of the dis¬ 
played angle 

Cosine Instructs the calculator to de¬ 

termine the cosine of the dis¬ 
played angle 

Tangent Instructs the calculator to de¬ 

termine the tangent of the dis¬ 
played angle 

Inverse trigonometric Instructs the calculator to de¬ 
termine the angle of the se¬ 
lected trig function whose 
value is the displayed quantity, 
when pressed as a prefix to the 
sin, cos, or tan key 

Common logarithm Instructs the calculator to de¬ 
termine the logarithm to the 
base 10 of the displayed 
number 

Natural logarithm Instructs the calculator to de¬ 

termine the logarithm to the 
base e of the displayed 
number 

e to the x power Instructs the calculator to raise 

the value of e to the displayed 
power 

y to the x power Instructs the calculator to raise 

y, the first entered number, to 
the power of x, the second en¬ 
tered number 


12 








Key Symbol 


Key Name 


Key Instruction 


xV~ y 


STO 


RCL 





CL* 



The *th root Instructs the calculator to 

of y process y, the first entered 

number, to find the *th root. 
The value of * is the second 
entered number 

Store Instructs the calculator to 

store the displayed number in 
the memory 

Recall Instructs the calculator to re¬ 

trieve stored data from the 
memory 

Sum and store Instructs the calculator to al¬ 

gebraically add the displayed 
number to the number in the 
memory, and to store the sum 
in the memory 

Instructs the calculator to find 
the factorial of the number 
displayed 

(Algebraic entry method only) 
Instructs the calculator to 
complete the previously en¬ 
tered operation to provide the 
desired calculation result 

Clears the last keyboard entry. 


Factorial 


Equals 


Clear entry 


Pi 


Enters the value of pi (7 7) in 
the display register 


ENTER | 


Enter 


Loads contents of * register 
intoy register and retains con¬ 
tents of x register in * register 
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1.6 THE PROGRAMMABLE POCKET CALCULATOR 

The most sophisticated pocket calculators available at present are the 
HP-55 and HP-65. They operate in Reverse-Polish notation, have memory 
stacks and registers, uses floating-point arithmetic with scientific notation, 
has an extensive three-level function set, and are programmable. From the 
standpoint that the programmable pocket calculator implements logical 
(Boolean) equations as well as Algebraic equations, can make logical 
decisions, and will iteratively execute a preprogrammed set of instructions, 
it can be correctly called a pocket computer. It is called a calculator only 
because it does not satisfy the U.S. Government’s import/export trade 
definition of a computer. Because it is generally accepted that the defini¬ 
tion of a computer (or calculator) changes as the state of the art of 
computer design changes, it is also acknowledged that in 1955 the pro¬ 
grammable pocket calculator would have been called a computer. 

Programmable calculators provide a quantum jump in pocket computing 
capability by making libraries of program listings and prerecorded mag¬ 
netic tape programs available to the analyst at relatively low cost. These 
libraries can be compiled by the user himself or can be purchased. 


1.7 THE CALCULATORS TO BE DISCUSSED IN THIS BOOK 

We have seen that there are three types of entry method, three types of 
memory, and three kinds of number that can be implemented in any of the 
three kinds of pocket calculator with (though not described here) four 
types of function set and two types of I/O —the hard copy and the manual 
I/O. Hence at least 432 types of calculator could be made up from 
different combinations of these electronic hardware alternatives. While the 
number of reasonable combinations is somewhat smaller, about 50, the 
number of possible types of pocket calculator is still too large to be 
covered in one book. We therefore analyze only three basic types of 
hypothetical pocket calculators. One is a simple four-function calculator. 
The other is an engineering machine, again a hypothetical one, but with a 
function set characteristic of the SR 50/51 or HP-21/35/45 series 
machines. The third is the programmable pocket calculator, which we 
assume to have a four-register stack with a nine-register addressable 
storage and a 100-word instruction set. This, too, is a hypothetical machine 
whose properties are defined in the context of the discussion. 
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Of the three hypothetical pocket calculators, emphasis is placed on the 
second—the engineering-type four-register stack machine with the usual 
complement of engineering functions. Also, because the simple four- 
function machine is now available at very little cost, attention is given 
throughout the book to performing advanced analysis on this machine. 
What continues to amaze the writer is the extent to which the four- 
function pocket calculator can be applied to engineering analysis once the 
equations to be solved are manipulated in forms that require no memory 
for their evaluation. 

For all these machines, we assume that we are limited at most to a 
10-digit register and we use floating-point arithmetic with scientific nota¬ 
tion. 

The keyboard for the hypothetical four-function calculator to be dis¬ 
cussed is sketched in Figure 1-1. The keyboard functions for the scientific 
and programmable pocket calculator are shown in Figure 1-2. The basis 
for the discussions dealing with this calculator is the HP-65, in that it is 
representative of what will be available in the foreseeable future. 



Figure 1-1 A hypothetical four-function pocket calculator keyboard. 
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Figure 1-2 A hypothetical scientific pocket calculator keyboard (mixed Algebraic and 
Reverse-Polish and programmable functions). 


The display details for all calculators discussed here are shown in Figure 
1-3. The display features that we will discuss from time to time include the 
following: 

Decimal point Assumed to be to the right of any number 

entered unless positioned in another sequence 
with the Q] key. 
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Sign 


Mantissa 


Exponent 


x x x x x x x x x (-} x x 


/ S- / 

Sign 


Integer decimal point Decimal 

Figure 1-3 Typical pocket calculator display format. 


Minus sign 


Overflow indication 


Underflow indication 


Appears to the left of the 10-digit mantissa for 
negative numbers, and appears to the left of 
the exponent for negative exponents 
In most pocket calculators, the largest number 
that can be entered in the calculator is ±9. 
999999999x 10" without an overflow when a 
function is pressed. If a calculation result is 
larger than this value, the display will flash or 
give a numerical indication of overflow 
If a number closer to zero than to ±1.0X 
10“ 99 is entered in the calculator, the display 
will flash or indicate an underflow. 


While concentrating on the hypthetical machines just mentioned, we 
shall comment on machines with slightly different keyboards where 
appropriate. 


1.8 ARITHMETIC CALCULATIONS AND LANGUAGES 

It might seem that the arithmetic functions of addition, subtraction, 
multiplication, and division are so basic to the pocket calculator that very 
little need be said about them. It is because they are so basic that they are 
discussed in some detail here. Arithmetic performed in one language is 
substantially different from that performed in another language. In one 
language certain arithmetic calculations are quite convenient and easy to 
remember to the infrequent user. Another language, though less con¬ 
venient to the beginner, is more powerful and flexible (hence more con¬ 
venient) to the frequent user. Finally, and perhaps most important, mixed 
arithmetic calculations illuminate the need for memory—whether manual, 
(using a scratch pad) or temporary data storage, (using automatic stacked 
registers) or permanent data storage, (using addressable memory). 




Table 1-1 Arithmetic in Algebraic and Reverse-Polish Languages 


Key Stroke Sequence 

Task Algebraic Reverse-Polish 


Sum A&B 

Sum A&B&C 

Sum A&B&C&D 

Multiply A&B 
Multiply A&B&C 


A + B = 

A+B+C= 

A+B+C+D= 

AX B — 

Ax B XC = 


A\B + 


I A'[B+C + 

\ A\B'[C+ + 

! A'{B+C + D + 
A\B'\C+ +D + 
A'[B'[C+D+ + 
A\B'[C'[D+ + + 


A'lBX 

( A\B X C X 

( ^t^T^xx 


Multiply A&B&C&D AxBxCxD = 


AfB XCXDX 
AfBfC X X D X 
AfBfC X D X X 
AfBfCTD XXX 


Compute 
(AXB) + (CXD) 


AXB+D+CXD 
= (no memory) 

AxB + CxD 

= (with hierarchy) 
AX B STO C X D RCL + 
= (with memory) 


A\B X CfD X + 
A'lB^C^D XR|XRf + * fl 


Compute 

(v4 + Z?)X (C + D) 


AX B STO C + D X RCL = 


A^B + C^D^- X 
^T^tCt^ + Ri + R|X* 


a *See page 21 for a definition of RJ, and R|. 
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The two most popular languages used in pocket calculators are algebraic 
and reverse-polish. The languages were introduced in Section 1.3. Table 
1-1 illustrates the key strokes involved in performing additions, multiplica¬ 
tions, and mixed arithmetic calculations such as products-of-sums and 
sums-of-products using both the algebraic language and the reverse-polish 
language. A number of insights on analysis on the various pocket calcula¬ 
tors can be derived by examining the table. The most obvious is that the 
algebraic language programs the calculation of simple series arithmetic 
calculations in exactly the manner in which we would write them as an 
algebraic expression reduced to its simplest form. It is equally obvious that 
even simple series arithmetic calculations can be performed in a number of 
different ways when using reverse-polish language (except for the simplest 
operations of adding and multiplying two numbers). In a sense, then, for 
these simple arithmetic tasks, the algebraic language has one unique 
sequence of key strokes for performing the task, while the reverse-polish 
does not. When viewed from the algebraic language enthusiast’s 
standpoint, this ambiguity in ways to solve simple series arithmetic prob¬ 
lems in reverse-polish is viewed as a possible confusion factor for the 
pocket calculator user. The reverse-polish language enthusiast, however, 
views the same property as a measure of the flexibility of the reverse-polish 
notation. From his viewpoint, the user has greater flexibility in the 
algebraic forms in which an arithmetic problem can be presented for 
numerical evaluation. Furthermore, he could argue, the first form shown in 
each of the series calculations in Table 1-1 is close to the algebraic 
language key strokes, differing only in the second and last key strokes. 

It is interesting that this distinction should come up at all, since the 
mixed arithmetic in the last two examples in Table 1-1 shows the many 
different ways in which the sum of products can be evaluated with the 
algebraic and reverse-polish languages. Note that the first example of the 
use of algebraic language to evaluate the sum of products illustrates the 
rewriting of the algebraic form as 

(A X B) + {C x + C^jD 

We see from the sequence of key strokes that the sum of products can be 
evaluated without memory. This form of evaluating the sum of products is 
ideal for use on the simple four-function calculators in that it requires no 
scratch pad memory and is within the set of operations available on even 
the simplest pocket calculator. A similar expression can be developed for 
calculating the product of sums without need of memory. Again, the 
algebraic form of the equation must be rewritten to be convenient for 
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calculator evaluation as 

((A + B)XC \ 

(A + B)x(C+D) = -- - - +A + B \D 

The importance of rewriting expressions in forms that are easily 
evaluated on the pocket calculator is obvious, however. The example of the 
sum of products (the second from the last in Table 1-1) shows that the 
most convenient form for implementation on any pocket calculator may 
depend on the language that that calculator uses and the sophistication 
with which it is implemented. For example, the second sequence of key 
strokes to evaluate the sum of products is in the standard algebraic form. 
This form works well for sums of products where the algebraic language is 
implemented with a hierarchy of operands, that is, the multiplies are 
performed before the sums. Also, the third example of the employment of 
algebraic language for evaluating the sum of products shows the standard 
algebraic forms for evaluating the sum of products on a machine that uses 
the algebraic language but has an additional register for memory. 

We observe in Table 1-1 also that no memory is required for performing 
simple arithmetic calculations in algebraic language until we reach the 
product of sums, the last example in the table. Such is not the case for the 
reverse-polish language. For example, only two registers are required for 
implementing the simple sum A + B + C+D in algebraic language. In 
reverse-polish language, only two registers are required to implement the 
sum as shown by the first example in the column of possible implementa¬ 
tions of this series of sums. The other three possible implementations 
require additional registers in which to store the data A , B , C, and D. 
Clearly in algebraic language additional registers would not permit al¬ 
ternative ways to evaluate the sum, while in reverse-polish every additional 
register leads to one additional way. In the example shown in Table 1-1 it 
is assumed that there are four registers in which to store the four data A, B, 
C, and D. Obviously, the use of reverse-polish language with stacks of data 
registers adds flexibility to a pocket calculator. In a sense, then, polish 
notation and stacks go together in a pocket calculator. It is also apparent 
that algebraic language eliminates the need for extensive stacks of data 
registers, since no additional flexibility is permitted with the addition of 
register stacks. Therefore, most calculators that use algebraic language 
have smaller memories than pocket calculators using reverse-polish. 

Another observation that we can make from Table 1-1 is that machines 
with algebraic notation which also have hierarchy and an additional 
register of memory (such as the SR-50) embody the highest level of 
capability available for pocket calculators using the algebraic language. 
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Such algebraic machines compete effectively in conducting mixed arith¬ 
metic calculations with the reverse-polish language machine, such as the 
HP35/45 series, with somewhat less electronic complexity. However, the 
reverse-polish with stacks adds operational flexibility for the user, which 
the algebraic machine does not. Moreover, the algebraic machine requires 
that the form of the equation be evaluated, particularly if it is highly 
complex. The reverse-polish language, on the other hand, provides the 
flexibility to evaluate very complex expressions with minimum attention 
being paid to the arrangement of terms. This flexibility is in part due to 
additional arithmetic registers that the typical reverse-polish machines 
generally have. 

Because the manipulation of data among the data registers is essential to 
understanding both the reverse-polish with stacks machine and the ad¬ 
vanced algebraic machines, we discuss memory manipulations next. 

When we speak of a reverse-polish machine with stacks, we assume that 
a stack consists of four registers for storing numbers. Following Hewlett- 
Packard notation, we call these registers X, Y, Z, and T. Register X is at 
the bottom of the stack, T is at the top of the stack, and the display always 
shows the number in the X register. We designate the number in the 
register by the same letter in italic type. Thus X, Y, Z, and T are the 
contents of registers X, Y, Z, and T. When a number key is stroked, the 
number enters the X register which is displayed. The number is repeated in 
the Y register when the “enter” key in is stroked.Whatever is in the Y 
register is “pushed up” into the Z register. The contents of the Z register 
are moved into the T register, and the contents of the T register are lost 
(see Figure 1-4). As data are entered into the Y register from the X register, 
the data in the other registers are “pushed up” automatically with the only 
data lost being the data in the T register. Data in the Y register can be 
viewed in the display by rolling the data from the Y register down to the X 
register by stroking the “roll-down” key | Rj | . The data in the X register 
are then worked backwards in the stack to move to the top register (T), the 
data in the top register move to the Z register, the data in the Z register 



Lost 

T 

Z 

Y 


X Figure 1-4 Data flow associated with data entry. 
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move to the Y register, and, as mentioned before, the data in the Y register 
move into the X register where they are displayed. Stroking the “roll- 
down” key again causes the data that were formerly in the Z register, 
which have been moved to the Y register, to move down to the X register 
where they can be seen in the display. All other data are moved to a 
neighboring register in the direction in which the roll is made. It follows 
that after four “roll-down” key strokes the stack will be arranged back in 
the original order where X is in its original location and is displayed in the 
X register, Y is in its original location, Z is in its original location, and T is 
in its original location. Stroking the “roll-down” key moves the data in the 
registers in th e direc tion from the Y register to the X register. Stroking the 
“roll-up” key Rf moves the data in the direction from the X register to 
the Y register. The data flow associated with the data entry and “roll- 
down” and “roll-up” operations is seen in Figure 1-5. 



Another commonly used stack manipulation is the replacement of the 
data in the X register with the data in the Y register and vice versa. T he 

data flow associated with stroking the “X-Y exchange” key /'"A is 

x y 

sketched in Figure 1-6. 

The data flow associated with the stack operations, when performing 




Figure 1-6 Data flow associated with x,y 


X 
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addition, subtraction, multiplication, and division, is sketched in Figure 
1-7. We see the following: 

1. For summation, the contents of the Y and X registers are added and 
displayed in the X register. 

2. For subtraction, the contents of the X register are subtracted from the 
contents of the Y register and displayed in the X register; 

3. For multiplication, the contents of the X register are multiplied by the 
contents of the Y register and displayed in the X register; and 

4. For division, the contents of the Y register are divided by the contents 
of the X register and displayed in the X register. 



Figure 1-7 Data flow associated with , x 
and +. 


For these basic four functions the contents of the T register are always 
retained and never lost. This feature of the operational stack is very useful 
for certain repeated calculations. 

It is worth pointing out here that many of the functions evaluated by a 
single key stroke on the typical Reverse-Polish with stacks pocket calcula¬ 
tors result in the loss of some data in the operational stack. For instance, in 
the HP35/45 series calculators, the contents of the T registers are lost 
when evaluating trigonometric functions. They are retained when evaluat¬ 
ing logarithmic and algebraic functions, such as taking the square root, 
taking the inverse, taking the logarithms, or exponentiating. 

Figures 1-8 and 1-9 illustrate the typical data flow in the stacks when the 
product of two sums and the sum of two products are evaluated using 
reverse-polish with stacks. Figure 1-8 a shows the usual procedure for 
evaluating the sum of products, which does not involve the use of the top 
register. To illustrate the flexibility of the reverse-polish with stacks and 
operations associated with the top register, Figure 1-8Z> shows the same 
calculations using the “roll-up” and “roll-down” features of the stack 
manipulations. Figures 1-10 and 1-11 present the typical data flow 
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Figure l-8a Data flow associated with the sum of two products (A X B) + (C x D) using key strokes 
A'\B'KC\DX + on a Reverse-Polish Machine. 
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Figure 1-8A Data flow associated with the sum of two products (A x B) + (C X D) using key strokes 
A^B^C^D XR|XR| + . 
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Figure 1-9 Data flow associated with (A + B)X{C+ D) using key strokes A^B+C^D+X on a 
Reverse-Polish machine. 
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Figure 1-10 Data flow associated with keyboard functions of an Algebraic machine with memory and hierarchy (“multiply before 
add”). 
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Figure 1-11 Data flow associated with (A + B)X(C + D) using key strokes A +2? = STOC+ Z)= xRCL= on an algebraic machine 
with memory and hierarchy (“multiply before add”). 
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associated with keyboard functions and a calculation of the product of two 
sums using algebraic with memory. A comparison of Figures 1-8, 1-9, 1-10, 
and 1-11 indicates clearly that the greater the memory storage capacity in a 
pocket calculator the greater the flexibility in its use. 

The question of languages in pocket calculators is akin to that in 
minicomputers or large computers, or different nationalities for that matter 
—the language you know the best is the language you like the most, unless 
you have sufficient multilingual skills to recognize the subtle advantages of 
one language over another. What matters least is the type of language or 
size of memory associated with any specific pocket calculator; what 
matters most is to begin to use some pocket calculator in advanced 
analysis. The solid-state revolution has enabled the engineer to perform 
fairly sophisticated analysis at his desk, in his home, or on a trip, without 
the need for access to a computing facility. Simply stated, those who 
capitalize on this aspect of the solid-state revolution and keep current with 
the development of pocket computing machines will have a tremendous 
advantage over those who do not. 


1.9 SCIENTIFIC KEYBOARD FUNCTION EVALUATION 

In this section we use the four-function calculator to evaluate the scientific 
functions normally found on the scientific calculator keyboard. The sine, 
cosine, tangent, exponential, logarithmic, arc sine, arc cosine, and arc 
tangent functions are presented in nested parenthetical forms in two 
different ways. The first way is in the nested parenthetical form of the 
truncated series approximations of these functions. The second way is in a 
curve-fit polynomial form that permits precision evaluation of these func¬ 
tions over a broader range than the simple series expansions of the 
functions. Also covered here are the algebraic functions of raising a 
number to its flth power or evaluating its nth root. 


Raising a Number to a Power 

Raising a number to a power on a four-function calculator can be done 
simply by repeated multiplication. A fairly high power, such as 100, 
involves 100 data entries and 100 multiplies, which result in many key 
strokes and many possibilities for error. An alternative is to use the 
constant key available on many of the four-function calculators. The 
constant key is built into these calculators to make it convenient to 
multiply or divide a series of numbers by a constant number. In the case of 
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raising a number to a power, we put the chain-constant switch to the 
constant position and then input the number to the X register, depress the 
constant key, and then raise the number to the power n by stroking the 
equals key n times. This approach virtually eliminates the error associated 
with repeated data entry in the primitive /7-multiply approach. Even in this 
case, however, raising a number to the power of 100 involves 100 depres¬ 
sions of the equals key (an error prone procedure). This can be circum¬ 
vented by breaking down the power into its prime factors and performing 
nested parenthetical multiplies to evaluate numbers raised to high powers. 
For example, suppose that we wish to raise it to the 100th power with only 
a single entry of it. This can be conveniently done on the simple four- 
function calculator by noting that the prime factors of 100 are 2, 2, 5, 5. 
Then 



= (((9.86960440) 2 ) 5 ) 5 
= ((97.40909108) 5 ) 5 

= (8.769956822 X 10 9 ) 5 
= 5.187848391 x 10 49 

On pocket calculators that have the squaring operator, numbers can be 
raised to any integer power through the 10th by entering the data only 
three times. Since the prime factors of many of our exponents are made up 
of 2, 3, 5, and 7, raising numbers to these powers involves only two data 
entries. 

Computing roots on the four-function calculator requires iterative opera¬ 
tions. Among the various approaches to evaluating roots, the simplest is 
Newton’s method. Though this method leaves much to be desired in most 
applications, it can be used conveniently for computing the roots of 
numbers. We have more to say on this topic later in the book. For now, 
note that the formula for computing the roots is 

Xk + '=~n{ X \fz +"- 1 ) 

where x k = kth estimate of n VA 
This equation requires an initial approximation, which is used to develop a 
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second, more exact, approximation, which in turn is again used to develop 
a third, even more exact, approximation. The process usually converges 
quickly when the initial estimate of the rcth root is known. Convergence 
can be markedly slow, however, when the first estimate is not fairly close 
to the root in question. Examples of the convergence properties of the use 
of Newton’s method for evaluating the third, fifth, and seventh roots of 77- 
are shown in Table 1-2. 


Table 1-2 Examples of the Convergence of Newton’s Method 
for nth Roots of it 


Number of 
Iterations 



Vn 

0 

1.0 

1.0 

1.0 

1 

1.713864218 

1.428318531 

1.305941808 

2 

1.499089493 

1.293620977 

1.209849586 

3 

1.465379670 

1.259260005 

1.180121812 

4 

1.464592311 

1.257280369 

1.177679333 

5 

1.464591888 

1.257274116 

1.177664031 

6 

1.464591888 

1.257274116 

1.177664030 

7 


— 

1.177664030 

Check by 

3.141592656 

3.141592658 

3.141592655 

computing tt 




7t (actual) 

3.141592654 

3.141592654 

3.141592654 

Absolute 

0.000000002 

0.000000004 

0.000000001 

error 





Close examination of Table 1-2 shows that for an initial guess of 1, the 
process converges in five iterations to the accuracy of the pocket calcula¬ 
tor. However, a determination of this requires a sixth iteration, and in the 
case of the seventh root also a seventh iteration. The table also shows the 
check of the root by repeated multiplies and the comparison with the true 
value of 77, indicating an accuracy of 1 part in 10 8 after only six or seven 
iterations. In general, this method cannot be expected to converge so 
quickly for other functions. It happens to converge quickly for the /tth root 
function because of the nice properties of that function. Note that New¬ 
ton’s formula for computing the ni\\ root also works for the simple square 
root. This can be seen by setting n equal to 2 in the equation. Then 
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Newton’s formula for computing the square root iteratively (which is also 
due to Joseph Raphson, a contemporary of Newton’s—hence this method 
is often called the Newton-Raphson technique) gives the equation 

(Vw >‘ +,= 4(VV) Jt +(VAr k) 

which can be used for iteratively computing the square root of a number 
N. 


Nested Parenthetical Forms 

Many functions of interest to engineers can be written in a power series. 
This series can be generated by using Taylor’s theorem, Maclaurin’s 
theorem, Chebyshev polynomials, and so on. Furthermore, an empirical 
data set can be fit with power series. When so written, they take the 
“standard” form 

f(x) — a 0 + a x x + a 2 x 2 + a 3 x 3 + • • • + a n x n + • • • (1-1) 

If we were to evaluate this series in the most straightforward manner on 
the simple four-function calculator, we would compute each term in the 
series and record it on a scratch pad. When all the terms of interest were 
evaluated, the sum would be computed on the pocket calculator. The 
number of key strokes involved for a 10-digit data entry is shown in Table 
1-3. The total number of key strokes for data entry plus instruction are 

n 

Total key strokes = 2 12/+ 10 = 6n 2 + \6n (1-2) 

i= i 


Table 1-3 Key Strokes Required to Evaluate 
Power Series in Standard Form 


Operation 

Key Strokes 

Record, 0 a 0 

0 

Compute and record, a x x 

22 

Compute and record, a 2 x 2 

34 

Compute and record, a 3 x 3 

46 

Compute and record, a n x n 

12/1 + 10 


"No one would input a 0 in the calculator and then 
recopy it on a scratch pad. 
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assuming that each data entry involved the full register. Clearly for n > 3 
the number of key strokes becomes laboriously large and for n > 5 the 
chances for error become enormous. By rewriting equation 1-1 in the form 

a 0 + x( ai + x(a 2 + x(a 3 + • • • + x(a n _ 2 + x(a n _ l + a„x)) • • •))) (1-3) 

we reduce the number of key strokes, because the formula is organized in 
the natural language of the machine and requires no scratch pad storage. 
In pocket computer instructions, equation 1-3 would be evaluated working 
from the inside out with the following instruction set: 



The number of key strokes for this evaluation of equation 1-1 is 

Number 1* 2 3 ••• n 

of terms 

Total key 0 33 55 10(2/7- \) + 2n -1 

strokes 

And the key strokes for data entry plus instructions total 

total key strokes =11(2/7-1) (1-4) 

We see, then, that in nested parenthetical form we can carry up to 10 
terms before the dimensions of the problem get out of hand—that is, up to 
six more terms than in the “standard” form. This business with the forms 
of equations is worth remembering for series evaluation on any calculator 
or computer in that the nested parenthetical forms are generally processed 
faster than are standard forms when computing time is involved. This is 
because the number of arithmetic operations grows as the square of the 
number of terms for series written in standard form and only proportional 
to the number of terms for series written in nested parenthetical forms. 

Note also that it is unnecessary to use a scratch pad when evaluating 
series in parenthetical forms, since the operands and operations are in the 


*No one would evaluate a single-term series on the calculator. 
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appropriate order for evaluation with algebraic, polish, or reverse-polish 
entry methods. 

Comparing equations 1-2 and 1-3 we see that the nested parenthetical 
form substantially reduces the number of key strokes by reducing the 
number of data entries required for the calculation. Even more dramatic is 
the impact that rewriting the equation in nested parenthetical form has on 
the time required to perform the numerical evaluation of the power series 
on a pocket calculator. If we assume that, on the average, for every key 
stroke and digit record the calculation takes 1 second, we would expect the 
nested parenthetical form to involve 22/(6 n + 32)% (for n> 5) of the time 
required for a standard-form power series evaluation. In general, nested 
parenthetical forms of power series or polynomials are more quickly 
evaluated than the standard forms. The more common scientific keyboard 
functions can be evaluated on the four-function calculator using the 
following nested formulas: 


In (1 + x) = x 1 — — 1 


2x 


2>x 


1 - — ( 1 
4 1 5 


4x 


(W<0 


, w q, + i(,4(,4(,.!)))). 


ln(*) = Wl~|l — 


2 


y = (x- 1), (\x- l| < 1) 


In 


x+\ 

jc — 1 


«-| 1 + 


e x =\ + x\ 1 + ^1 + 


1 

6x 2 
2x 


1 + 


1 + 


5x 2 

6x 


1 + 


lx 2 


1 + 


9x 2 


\x\ > 1 


24 


24 x[ ] m \20 
+ 120 ( + 720* 


sin (x) = x 1 —— 1 — 


6x 2 


cos(jc)ss 1 1 - 


120 

2x 2 _ 

24 


1- 


\20x* 

5040 


/, 5040 2 \\ 

\ 362880// 


ta n w^(. + T(. + iy(. 

cotanW-l-f(l + ^(l 


24x2 

720 


. 255 2 
+ 630" 


- 90 2 
945 X 


720 

40320' 
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arcsinW^l + £(' + ^(l + S* 1 )))’ W < ' 

arctan(x) = x| 1 - y-|l - -y |l - ^y- jjj, x 2 < 1 

arctan (j c)«f W>1 


These formulas were selected on the basis of the reasonableness of their 
intervals of convergence. The four different approximations for the natural 
logarithm span the region from x = — \ to +oo. They are all written in 
convenient nested parenthetical forms and can be used for immediate 
evaluation on the pocket calculator. This table, if copied and reduced, can 
be conveniently taped to the back of your pocket calculator for handy 
reference. 

Another approach to evaluating these scientific functions is to use a 
curve-fit polynomial over a broad range of the argument. Such poly¬ 
nomials are tabulated in Table 1-4 for the functions on the keyboard of 
the scientific pocket calculator. These polynomials will permit precise 
evaluation of the logarithmic, exponential, and trigonometric functions on 
the four-function calculator and thus make it capable of performing any 
analysis that can be performed on the scientific pocket calculator. 

To put functions into forms that are easily computed on the pocket 
calculator, use the following procedures: 


Procedure 1 

(a) Either find or generate a table of values for the function of interest 
to the accuracy of interest. 

(b) Prepare an interpolating polynomial (see Chapter 2) that passes 
through selected points of interest in the table but spans the range 
of interest in the argument. 

(c) Identify the maximum error of the polynomial approximation on 
the interval of interest. 

(d) If the accuracy is satisfactory, write the polynomial in nested 
parenthetical form, and then use it for approximate evaluation of 
the function on the pocket calculator. 



Table 1-4 Polynomial Approximations of Many Functions Found on the Keyboard 
of the Scientific Pocket Calculators 


(1) Log 10 (x) = t(a x + t\a 3 + t\a 5 + t\a n + a 9 2 )))) + e(x) 

Here 

r = (x-l)(x+l) _1 

and 

|€(x)|<10 -7 where 10"" 1//2 < x < 10 + 
for 

a x =0.868591718 a 2 = 0.094376476 

a 3 = 0.289335524 a 9 = 0.191337714 

a 5 = 0.177522071 

(2) Log 10 (x) = t(a x + a 3 t 2 ) + e(x) 

where t — (x- l)(x + l) -1 , a x = 0.86304, and a 3 ~ 0.36415. 

Then 

|e(jc)| < 6X 10 -4 where 10 _ 1/2 < x < 10 +1/2 

(3) Ln(l + x)=x(a, + x(a 2 +x(a 3 + x(a 4 +a 5 x)))) = c(x) 

Here 

a x = 0.99949556 a 4 = -0.13606275 

a 2 = -0.49190896 a 5 = 0.03215845 

a 3 = 0.28947478 

Then 

|e(x)| < 10~ 5 where 0 <x<l 

(4) Ln(l + x) = x(a x + x(a 2 + x(a 3 4- x(a 4 + x(a 5 + x(a 6 + x(a 7 + a s x))))))) + e(» 
Here 


a i = 

0.9999964239 

a 5~ 

0.1676540711 

a 2 = - 

-0.4998741238 

a 6 = ■ 

-0.0953293897 

a 3 = 

0.3317990258 

a~i — 

0.0360884937 

a 4 = 

0.2407338084 

a 8 = 

0.0064535442 


and 

|e(x)| < 3 X 10 -8 where 0 < * < 1 

(5) e~ x — 1 + x(a x +a 2 x) +f.{x) 
where 

a j=— 0.9664 and « 2 = 03536 
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Then 

|e(x)| < 3x 10 -3 where 0<x<ln2 

(6) e~ x = 1 + x(a x + x(a 2 + x(<3 3 + a 4 x))) 
where 

a, = - 0.9998684 a 3 = - 0.1595332 
a 2 = 0.4982926 * 4 = 0.0293641 

Then 

|e(x)| < 3 X 10~ 5 where 0 < jc < In2 

(7) Sin(x) = x(l + x 2 (a 2 + a 4 x 2 ))+ xe(x) 
where 

a 2 = -0.16605 and a 4 = 0.00761 

Then 

|e(x)| <2X 10~~ 4 where 0<x<y 

(8) Sin(x) = x(l + x 2 (a 2 + x\a 4 + x 2 (a 6 + x 2 (a s + a l0 x 2 ))))) + xc(x) 
where 

a 2 =- 0.1666666664 a 8 = 0.0000027526 

a 4 = 0.0083333315 a w = -0.0000000239 

a 6 = -0.0001984090 

Then 

|e(x)| <2X 10 -9 where 0<x<y 

(9) Cos(x) = 1 + x 2 (a 2 +a 4 x 2 )+e(x) 
where 

a 2 = -0.49670 
a 4 = 0.03705 

Then 

|e(x)| <:9X 10~ 4 where 0<x<y 

(10) Cos(x) = 1 + x\a 2 + x 2 (a 4 + x 2 (a 6 + x 2 (a 8 + a xo x 2 ))))+ e(x) 
where 

a 2 = — 0.4999999963 a 8 = 0.0000247609 

a 4 = 0.0416666418 a 10 = -0.000002605 

a 6 = -0.0013888397 
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[t( jc)|< 2x10 9 where 0<x<y 

(11) Tan(x) = .x(l + x 2 (a 2 + a 4 x 2 )) + xe(x) 
where 

« 2 = 0.31755 
a 4 = 0.20330 

Then 

|e(x)| < 10”" 3 where 0<x<^ 

(12) Tan(x) = x(l + x 2 (a 2 + x 2 (a 4 + x 2 (a 6 + x 2 (a s + x 2 (a i0 + a n x 2 ))))))+ xe(x) 
where 

a 2 = 0.3333314036 a s = 0.0245650893 
a 4 = 0.1333923995 a 10 = 0.0029005250 

a 6 = 0.0533740603 a l2 = 0.0095168091 

Then 

Je(jc)l <2x 10 -3 where 0<;c<^ 

(13) Cotan(x) = + x\a 2 + a 4 x 2 )) + 

where 

a 2 = -0.332867 
a 4 = -0.024369 

Then 

|e( jc)| < 3 X 10“ 5 where 0 <jc<-^ 

(14) Cotan(x)= -^(1 + x 2 (a 2 + x 2 (a 4 + x 2 (a 6 + x 2 (a 8 + a 10 x 2 )))))+ —~ 
where 

a 2 = -0.3333333410 a 8 = -0.0002078504 

a 4 =- 0.0222220287 a w = - 0.0000262619 
a 6 = 0.0021177168 

Then 

|e(jc)| < 4x 10“ 10 where 0<x<-|^ 
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Table 1-4 fContinued) 

(15) Arcsin(jc)= y -(1 — x) X/ \a 0 + x(a x + x{a 2 + a 3 x))) + e(x) 
where 

a 0 = 1.5707288 a 2 = 0.0742610 

a x = -0.2121144 <* 3 =-0.0187293 

Then 

|e(x)| < 5 X 10 -5 where 0 <jc<1 

(16) Arctan(x) = x(a } + x 2 (a 3 + x\a 5 + x 2 (a 7 + a n x 2 )))) + t(x) 
where 

a,= 0.9998660 a 7 =-0.0851330 

a 3 = —0.3302995 a 9 = 0.0208351 

a 5 = 0.1801410 

Then 

|e(x)| < 10~ 5 where - 1 < jc < 1 


If the tables are not available and there is not sufficient time to prepare 
them, use Procedure 2. 

Procedure 2 

(a) Prepare a series approximation of the function centered on the 
interval of interest. 

(b) Using a Chebyshev polynomial economization scheme (see Chapter 
8) reduce the order of the polynomial. 

(c) Test the polynomial for accuracy over the argument’s interval of 
interest. 

(d) If the polynomial is not sufficiently accurate, include more terms in 
the original approximating polynomial before Chebyshev 
economization, then use the Chebyshev procedure and test the 
polynomial again. 

(e) When the polynomial is sufficiently accurate, write it in nested 
parenthetical form and use it to evaluate the function on the pocket 
calculator. 

The numerical methods associated with generating interpolating 
polynomials are discussed in Chapter 2. The Chebyshev economization 



Accuracy in Function Evaluation 


39 


procedure and approximation with rational polynomials are discussed in 
Chapter 8. 

An interesting aside is that the logarithmic, exponential, and transcen¬ 
dental functions and their inverses and hyperbolic counterparts are typi¬ 
cally generated in pocket calculators with pre-programmed, recursion 
algorithms. These algorithms generate the numerical values of these func¬ 
tions using CORDIC techniques*. A CORDIC technique does not im¬ 
plement series expansion approximating polynomials. They are hardware 
algorithms that generate the numerical values of the mathematical func¬ 
tions in which we are interested. In a word, function evaluation on the 
pocket calculator is done to high precision using computing techniques and 
algorithms that are convenient and efficient from a circuit implementation 
viewpoint more than an analytical viewpoint. 


1-10 ACCURACY IN FUNCTION EVALUATION 

Books on numerical analysis or computer calculations usually present the 
equations for propagating relative or absolute error through an analysis. In 
this book we take a slightly different approach. Our concerns here are 
working within the limitations of the pocket calculator’s computing capa¬ 
bility and understanding the calculator’s impact on the generation of error 
that gets introduced into the problem. We wish to identify methods and 
techniques for getting around these problems. 

The floating-point number system affects the calculations on the pocket 
calculator through its treatment of overflow and underflow. When a 
number exceeds the largest number in the calculator, the calculator is 
usually set to its largest number and the calculation is set to overflow the 
contents of the calculator. Similarly, when the calculation calls for a 
number that is smaller than the smallest number in the calculator, the 
number usually is set equal to zero and the calculation is set to underflow 
the machine’s capability. Intuitively, replacing an underflow by zero seems 
more reasonable than replacing an overflow by the maximum number 
available in the calculator. However, one must be careful in such generali¬ 
zations. Computing e 228 using the inverse of e~ Z2B is not the same as 
evaluating c 228 directly. The reason is that e -228 is set equal to zero and 
thus the inverse is undefined, while e 228 is within the number system of the 
pocket calculator. 


1 


e 


-228 


1 


0 


underflow 


^undefined 


e 228 = 1.045061560 X 10" 


*The Cordic trigonometric computing technique—IRE Transactions on Electronic Computer, 
September 1959. 
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What is surprising is that these number system “end effects” can lead to 
some practical limitations on the range of variables for which the function 
can be evaluated. Table 1-5 shows the effect of overflow and underflow on 
the range of the function x 5 e x /(e x — 1). 


Table 1-5 The Effect of Overflow and Underflow on the 
Range of Function Evaluation 


X 

x J e x / e x — 1 

x 5 /\-e~ x 

1 

1.581976707 

1.581976707 

10 

1.000045407 X10 5 

1.000045407 XI0 5 

100 

1.0X10 10 

1 x 10 10 

200 

3.200000023 X10 11 

3.200000023 X10 11 

202 

3.363232171 X 10 11 

3.363232170X10 11 

203 

3.447308829 X10 11 

3.447308829 X10 11 

204 

Overflow 

2.533058573 X10 11 

220 

Overflow 

5.153631990X10 11 

225 

Overflow 

5.766503900 X10 11 

226 

Overflow 

5.895792594 X 10 11 

227 

Overflow 

6.027389914 X 10 11 

228 

Overflow 

Underflow 


The function is written in two ways in the table: favoring underflow and 
favoring overflow. That is, the function in the first column will eventually 
overflow the calculator’s field of numbers because of the evaluation of 
x 5 e x , while the function in the second column will eventually underflow 
the calculator’s field of numbers because of the evaluation of e~ x . The 
table shows that the range of the variable x for which the function can be 
evaluated is limited sooner by the overflow effect than the underflow 
effect. In fact, the function written in the form that will eventually result in 
underflow can explore the range of the argument which is 12% greater than 
the same function that will eventually result in overflow. In general, pocket 
calculator analysis favors functions written in the form that will eventually 
underflow. 


Roundoff Error 


Roundoff error is similar to the end effects associated with underflow and 
overflow. While many understand roundoff, its practical impact on en¬ 
gineering-type calculations is often ignored with occasionally surprising 
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results. Because some of the modern pocket calculators display mantissas 
to 13 places, it is easy to overlook the roundoff effect in a calculation, 
thinking that the calculator’s large mantissa will certainly maintain ac¬ 
curacy through a sequence of calculations. The question here, then, is not 
how to round off a calculation but, rather, how does the roundoff intro¬ 
duce error in a practical manner in a calculation? Roundoff is an end 
effect. It is similar to underflow and overflow in that the last digit in the 
mantissa is arbitrarily changed to another number on the basis of some 
rationale. It is different from underflow and overflow effects: end effects 
associated with the number system in the calculator impact the range of 
the argument that can be examined; roundoff does not. Roundoff can 
actually propagate error into the most significant digits of the calculation. 
One might ask, “How does the roundoff of a three or four significant digit 
number propagate into the most significant digit?” This is precisely what 
we shall discuss here; an example of how roundoff in the third significant 
digit propagates to the first significant digit resulting in a 100% error is 
used to illustrate the problem. 

Table 1-6 shows the calculation of the difference between products of 
numbers known accurately to three significant digits. 


Table 1-6 Error from the Least Significant Digits to the most Significant Digits 


Desired 

Calculation 

Calculator 

Results 

Rounded 

Calculator 

Results 

Calculator 

Results 

Rounded 

0.234X0.567 

0.132 678 

0.133 

0.132 678 

-0.232X0.567 

-0.131 312 

-0.131 

-0.131 312 

O.xxx 

1.366 X 10 -3 

2X 10~ 3 

lx!0~ 3 


Column 1 shows the desired calculation. Column 2 shows the results 
achieved on a pocket calculator, and column 3 shows the results achieved 
by first rounding each of the numbers generated in the product and then 
taking the difference. Column 4 shows first taking the difference between 
the unrounded numbers and then performing the rounding operation. 
Precisely what we mean here by rounding is the following. When the two 
three-digit numbers are multiplied, their product has either five or six 
places. Because the original numbers are only known to three places, we 
must drop two or three digits from the product. The rounding operation is 
adding 1 in the third place if the fourth-place digit is five or greater, or 
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adding zero to the third digit if the fourth digit is less than five. 

Now let us examine Table 1-6 closely. The desired calculation involves 
roundoff because the numbers in the products are only known to three 
places. The calculator results that are displayed to five or six places are 
really only known to three places and thus the number must be rounded. 
The result of taking the difference of the unrounded numbers is 1.366 X 
10~ 3 which is only accurate to the first digit. If the second, third, and 
fourth digits are retained in additional calculations, they introduce a 
multiplication error into the problem that is propagated forward in any 
calculations. Clearly, the propagation of this type of error in an extended 
calculation can provide meaningless results. This roundoff error is well 
known and is not commonly made by most analysts. 

It is the errors associated with the third-column calculations that are 
occasionally introduced into calculations. They arise from what seem to be 
reasonable calculations but are in fact mathematically incorrect and thus 
introduce substantial errors. The results in the first row of calculations are 
rounded to the third significant digit before the subtraction is performed, 
giving 2xl0“ 3 . Column 4 shows the subtraction being performed before 
the roundoff is performed. It is apparent that the difference in the two 
calculations is a factor of 2 (100% difference in the two numbers). The 
rationale for the calculation of column 3 is that we really only know the 
number to three significant digits, and thus should round each product 
before subtracting. The rationale for the calculation in column 4 is that 
rounding arbitrarily changes one of the numbers in the calculation, which 
introduces an end-effect error. In column 3 there are two end-effect errors 
which can combine into a sizable resultant error, while in column 4 only 
one end-effect error occurs when the products and subtractions are com¬ 
pleted and the result is rounded. In this sense, then, if a column of n 
products are taken followed by n subtractions, where roundoff is per¬ 
formed after the multiplication, there are n opportunities for propagating 
the roundoff effect from the third significant digit to the first significant 
digit. However, if the roundoff is performed after the subtractions are 
made, there is only one opportunity for propagating this roundoff error 
forward into the most significant digits. Thus the rule of thumb for 
accurate calculations is to roundoff on the last step. The example chosen 
here carries the roundoff error immediately from the last significant digit 
to the first significant digit, which usually is not the situation. It is worth 
pointing out, however, that calculations to one part in 10,000 involving 
differences can move roundoff error as much as three significant digits 
forward, thus modifying sensitivity analysis (evaluation of derivatives with 
finite differences) results in the third and even second places. 

In summary, roundoff becomes a problem in calculations mainly when 
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two numbers of the same size are subtracted. The roundoff propagates 
forward as a result of the cancellation of the leading digit in a subtraction 
process. This brings the roundoff errors from the least significant digit into 
the most significant digits. Because the display in the calculator’s display 
window shows a mantissa to 13 places, the inexperienced analyst can be 
“spoofed” into assuming incorrectly that he has an accurate number. 

Unfortunately, there is no systematic approach to analyzing the effect of 
roundoff in extensive calculations. All that can be said is that care must be 
taken not to write equations in forms leading to differences of equal-size 
numbers. Even this is difficult, because the values of the parameters of the 
problem that result in the difference of equal-size numbers often are not 
apparent, so that significant roundoff error cannot easily be predicted. The 
only practical resolution is to strive to write equations in forms that 
minimize the use of subtraction. 


Relative Error 

As already mentioned, the absolute error in the fixed-point number system 
is fixed, while the relative error in a floating-point number system is fixed. 
That is, the difference between two numbers in the fixed-point number 
system is always the same; in the floating-point number system it is not. 
The difference between two floating-point numbers, when the numbers are 
close to zero, is smaller than the difference between two floating-point 
numbers when the numbers are close to the maximum size in the calcula¬ 
tor. The difference between two numbers divided by either of the numbers 
is approximately fixed in the floating-point number system, while it varies 
in the fixed -point number system. Thus the floating-point number system 
tends to emphasize relative rather than absolute error, as do most engineer¬ 
ing and scientific analyses. Hence it is the natural number system for 
scientific calculations. 

A similar situation occurs in evaluating functions. Scientific pocket 
calculator analysis favors functions written in a form that minimizes 
relative error rather than absolute error. Although this is well known to the 
experienced analyst, and seems quite rational to the practical analyst, we 
still find it prevalent to use “absolute error” as an accuracy criterion in 
numerical analysis. For example, calculating e~ x over the range 0 to 3, to 1 
part in 10 3 using a Taylor series expansion, requires on the order of 12 
terms in the series. That is, the contribution made by the thirteenth term in 
the Taylor series expansion of e~ x , when x = 3, is something less than 
10 -3 . However, if, more reasonably, we require that the relative error be 1 
part in 10 3 , only nine terms are needed in the series. The absolute error 
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criterion requires 30% more terms than what is usually required for 
engineering analysis. In general, when deriving approximation formulas, it 
is important to decide what type of error is important to the problem being 
solved and use approximation methods that provide the appropriate ac¬ 
curacy. Too often the approximation is laboriously long and too accurate 
for the purpose. 

In Chapter 4 we evaluate power series forms of advanced mathematical 
functions such as Bessel’s functions and Legendre polynomials. Our 
emphasis then in these cases, is on both relative error and absolute error. 
The formulas based on relative error criteria have fewer terms than those 
based on absolute error. In scientific work (where relative error is of 
concern) this results in a significant reduction in the work required to 
evaluate these functions on the pocket calculator because the number of 
key strokes involved in raising the argument to high powers is eliminated. 


Rearranging Expressions to Minimize Error in Function Evaluation 


The pocket calculator’s sole function is the numerical evaluation of 
mathematical functions. Hence it does not have alpha-numeric displays or 
the ability to display words, except by coincidence.* Its purpose is to 
evaluate functions. We have already examined the effects of underflow 
and overflow, roundoff, and the error criterion itself on the accuracy of 
numerical evaluation. Now we briefly look at the functions to be evaluated 
and how they may be written in forms where loss of accuracy due to the 
subtraction of two almost equal-sized numbers does not occur. 

There are a number of "‘tricks” to handling the difference between two 
numbers that are close together. However, one general technique exists 
that can resolve many problem situations where the difference of two 
numbers are of approximately the same size. Consider the function 


h(x)=f(x + e)-f(x) 

As already discussed, the numerical evaluation of h(x) can propagate 
roundoff error forward into the leading significant digits. This function can 
be modified as 


/?(*)= {/(* + €)-/(*)} 


f(x + e)+f(x) 1 
f(x + e)+f(x) J 


h(x) 


f 2 (x + z)-f 2 (x) 
f(x + e)+f(x) 


See Appendix A. 
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This is a general equation that can, for algebraic and certain transcen¬ 
dental functions, transform the difference of two neighboring numbers into 
the ratio of sums of the numbers capable of being evaluated accurately on 
the pocket calculator (or on any calculator or computer). For example, if 


f(x) = x 2 

then 


/t( , (* + €) 4 --x 4 _ 4x 3 e 

2x 2 + 2xe + e 2 ~2x 2 (l + e/x) ~ 

For another example, consider the function 


Then 


/(x) = sin(x + e) 


h(x) = sin (x + e) — sin (x) = 2 cos ^x -I- ^ ) sin ( ) 

which for small e (but not necessarily small x) is 



An example suggested by Hamming is 


(* + e , l/ 2 -W l/2 = 


[(x +£ ) ,/ 2 - W ,/2 ][(, +£ ) ,/2 +w ,/2 ] 


( X + c )' /2 + ( X ) l/2 


(jc + «) i/2 + (*) ,/2 


With regard to other techniques, Hamming makes the interesting obser¬ 
vation that what appear to be a large number of tricks to reformulate a 
function to handle its finite difference are really not new to the analyst. 
They are exactly the same methods used in calculus to derive the func¬ 
tion’s derivative. We can see this from the definition of the derivative 


lim 

Ax—>0 


Ay 

Ax 


lim 

Ax—*0 


f(x + Ax)-f(x) ) 
A\ j 


As a final resort to avoiding subtraction of nearly equal-sized numbers, 
most functions can be series expanded or approximated with different 
types of series for the interval of interest. Then h{x) can be formed and 
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modified as before to get around the subtraction problem. 

An approach that works surprisingly well for certain functions (see 
Example 1-4) is to use the mean value theorem of differential calculus, 
where 


m-f(a) = (b-a)f\8), ( a<9<b) 

As an example of the application of the mean value theorem, let us 
compute 


h ( x) = sin (x + f) — sin (x) 

where x + € is not necessarily small. Using the mean value theorem, we 
then find 


for 


h(x ) = |(x + e) — (x)]cos(#) = e cos# 


x 4- e < # < x 

The difficulty is in selecting the value of # that will accurately compute 
/(x); that is, in selecting 9 that produces less error than would be produced 
by the propagation of the roundoff error into the most significant digits. 
The author knows of no method for effectively estimating 9 to ensure 
accuracy greater than is given by taking the difference itself. However, the 
midvalue interval is an obvious possibility. In this case, we find 

h(x) = €COS^X + ^ 


Clearly this method is of questionable value (for precision evaluation) ex¬ 
cept when# can be determined.The equation is useful, however, for com¬ 
puting the extreme values of the difference by using the expressions 

ccos(x + e) 
c cos(x) 

on the interval of the calculation. 

A few commonly used difference equations for circumventing large 
errors in taking the difference between nearly equal values of popular 
transcendental functions are tabulated in Table 1-7. 
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Table 1-7 Commonly Used Difference Equations 
in Functional Evaluation 


Ae x = e x (e* x -\) 


Ain (jc) = In ^1 + ~~ j 
A sin (Inx) = 2sin(7rAjc)cos 




Acos( 277 -jc) = — 2sin(77-Ajc)sin ^ 27 t^jc + y j J 
A tan (2vx) — sin {lirSx) sec (277- jc) sec(27rjc + 2ttSx) 
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1-11 REFERENCES 

One comprehensive, readable volume was selected to use as a reference 
throughout this book: Richard Hamming’s Numerical Methods for Scien¬ 
tists and Engineers. This book, published by McGraw-Hill, is in its second 
printing (1973). Dr. Hamming has also published a superb textbook 
entitled Introduction to Applied Numerical Analysis (McGraw-Hill, 1971). 
For this chapter refer to Hamming’s Numerical Methods for Scientist and 
Engineers , Chapters 2 and 3. 

Example 1-1 Evaluate ln(0.9) using the fifth-order truncated Taylor series 
expansion of ln(l + x) in the neighborhood of x—\. 

'"„^>-('-f(‘-T('-T('-f))))- M<. 

Now 

1 + jc = 0.9 
.\* = - 0.1 

Then 

ln(0.9)«-0.l(l - _ 2xai(, _ 3*M(, _ ixMjj 

A typical algebraic key stroke sequence for evaluating this polynomial is 
4x0.1 CHS- 5+ 1 X 3x0.1 CHS -4+ 1 x 2x0.1 CHS- 3 


+ 1 X 0.1 CHS-2+ 1 x 0.1 CHS = 
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A typical reverse polish key stroke sequence is 

4|0.lx5 + CHSl+ 3x0.1 X4-CHS1+2x0.1 x3- CHS1 

+ 0.1 x 2 CHS 1 + 0.1 X CHS 

Accuracy considerations over a broader range of jc are given in Table 
1 - 8 . 


Table 1-8 Accuracy of the Fifth-Order Taylor Series Expansion of In (1 + x) 


(1+x) 


ln(l+x) x[\ - x/2 (\- • • 

)] 

Absolute 

Error 

Relative 
Error (%) 

0.9 

-0.1 

-0.10536052 

-0.10536033 


-0.00000018 

00.000173 

0.8 

-0.2 

-0.22314355 

-0.22313067 


-0.00001288 

00.005774 

0.7 

-0.3 

-0.35669494 

-0.35651100 


-0.00016394 

00.04596194 

0.6 

-0.4 

-0.51082562 

-0.50978133 


-0.00104429 

00.20443190 

0.5 

-0.5 

-0.69314718 

-0.68854167 


-0.00460551 

00.80241261 

0.4 

-0.6 

-0.91629073 

- 0.89995200 


-0.01633873 

01.78313839 

0.3 

-0.7 

- 1.20397280 

- 1.15297233 


-0.05100047 

04.23601512 

0.2 

-0.8 

- 1.60943791 

- 1.45860264 


-0.15083525 

09.37192077 

0.1 

-0.9 

-2.30258509 

- 1.83012300 


-0.47246209 

20.51876799 

Example 

1-2 

Evaluate ln(l 

+ x) using 

the 

fifth-order 

Chebyshev 


approximating polynomial 

ln(l + jc) = x(a x + x(a 2 + x(a 3 + jc(<2 4 + a 5 x)))), 0 < x < 1 


over the range 0 < x < 1 using the coefficients (from page 55) 


a { = 0.999949556 a 4 = -0.13606275 

a 2 = -0.49190896 a 5 = 0.03215845 

a 3 = 0.28947478 

The accuracy of this approximation is shown in Tables 1-9 and 1-10. 

Note that even outside the region where the approximating polynomial 
was designed to best approximate ln(l + x) it is more accurate than the 
“unconditioned” Taylor series expansion of ln(l + jc). Evaluation of ln(l + 
x) using the approximating polynomial requires approximately 60 key 
strokes (20 more keystrokes than are used in the Taylor series approxima¬ 
tion) whether using the reverse-polish or algebraic languages. The addi- 
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Table 1-9 Accuracy of the Fifth-Order Chebyshev Polynomial Approximation 
of ln(l + x) 


(1 + x) 

X 

ln(l + x) 

x[a 1 + x(a 2 + • ■ •)] 

Absolute 

Error 

Relative 
Error (%) 

1.1 

+ 0.1 

0.09531018 

0.9530666 

0.00000352 

0.003697 

1.2 

+ 0.2 

0.18232156 

0.18233114 

0.00000959 

0.005257 

1.3 

+ 0.3 

0.26236426 

0.26236872 

-0.00000445 

-0.001697 

1.4 

+ 0.4 

0.35647224 

0.33646527 

0.00000696 

0.002070 

1.5 

+ 0.5 

0.40546511 

0.40545592 

0.00000919 

0.002267 

2.0 

+ 1.0 

0.69314718 

0.69315708 

0.00000990 

-0.001428 


Tablel.10 Accuracy of the Fifth-Order Chebyshev of ln(l + x) Outside the 
Design Range of the Chebyshev Approximation 


1 + X 

X 

ln(l + x) 

x[a x + x{a 2 + • • •)] 

Absolute 

Error 

Relative 
Error (%) 

0.9 

-0.1 

-0.10536052 

-0.10517205 

-0.00018847 

0.178879 

0.8 

-0.2 

-0.22314355 

-0.22211926 

-0.00102429 

0.459028 

0.7 

-0.3 

- 0.35667494 

-0.35311655 

-0.00355840 

0.997658 

0.6 

-0.4 

-0.51082562 

-0.50084255 

-0.00998309 

1.954301 

0.5 

-0.5 

-0.69314718 

-0.66841824 

- 0.02472894 

3.567632 

2.1 

1.1 

0.74193734 

0.74210824 

-0.00017089 

0.023034 

2.2 

1.2 

0.78845736 

0.78913899 

-0.00068162 

0.086450 

2. 3 

1.3 

0.83290912 

0.83478743 

-0.00187831 

-0.225512 

2.4 

1.4 

0.87546874 

0.87972822 

-0.00425948 

-0.486537 

2.5 

1.5 

0.91629073 

0.92481112 

-0.00852039 

-0.929878 


tional key strokes are associated mainly with entering the coefficients a v 
a 2 ,...,a 5 . 


Example 1-3 Rewrite the difference 


*w- 


i 

X + 1 




in a form that will minimize roundoff error using a series expansion 
technique. The objective is to eliminate the differencing of two numbers of 
approximately equal size. Expanding the first term, we see that 


1 _ */•* 1 (, i i_L 

X + 1 1 + 1/x X\ X x 2 x 3 


+ • • • 


X I > 1 
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Then 




1 

X 




1-U-L--7 + 

•* x 1 x 3 


h(x) 



h( ' x) x 2 "(l + l/*) *(x+l) 

This form of h(x ) does not involve computing the difference of two 
numbers of nearly equal size. The range of x over which this derivation 
applies is |x| > 1. 


Example 1-4 Rewrite the difference 


in a form that will minimize roundoff error using algebra. 
Cross-multiplying, we find 


x-(x + 1) _ \ 

^ * x(x+l) x(x+l) 

This result is the same as that developed with the series expansion method 
except that it holds for all x, not just |x| > 1. This is an important point to 
remember. Derivations using series expansion techniques often lead to 
results that hold over a greater range of the independent variable than their 
derivation strictly allows. With a pocket calculator it is easy to check the 
dynamic range over which a derived formula will work. 


Example 1-5 Estimate sin(31°) —sin(30°) using the mean value theorem. 
By the mean value theorem, we obtain 

h(x) = sin(30° + 1 °)-sin(30°)^0.017453293 cos(30.5°) 
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Here 0.017453293 is the value of 1° in radians. Then: 

0.017453293 cos(30.5°) = 0.015038266 
sin(31 °) - sin(30°) = 0.015038075 
relative error (%) = -0.0012700 
absolute error = —0.000000191 

Table 1-11 indicates that the mean value theorem can be useful for 
engineering evaluations, since the relative error is very small. Care must be 
taken, however, in using the mean value theorem. Had we used cos(30°) 
instead of cos(30.5°) we would find 

0.017453293 cos(30°) = 0.015114995 

where actually 

sin(31°) - sin(30°) = 0.015038075 
absolute error = -0.000076920 
relative error (%) = —0.5115024 


Table 1-11 Accuracy of Mean Value Theorem Approximation of 
sin (0 + 1°) — sin0 


9 

(degrees) 

Sin 

(9+ l°)-sin 9 

Mean Value 
Theorem 

Absolute 

Error 

Relative 
Error (%) 

0 

0.017452406 

0.017452628 

-0.000060222 

-0.0012693 

10 

0.017160818 

0.017161036 

-0.000000218 

-0.0012693 

20 

0.016347806 

0.016348014 

-0.000000208 

-0.0012604 

30 

0.015038075 

0.015038266 

-0.000000191 

-0.0012700 

40 

0.013271419 

0.013271588 

-0.000000169 

-0.0012708 

50 

0.011101518 

0.011101659 

-0.000000141 

-0.0012723 

60 

0.008594304 

0.008594412 

-0.000000109 

-0.0012629 

70 

0.005825955 

0.005824029 

- 0.000000074 

-0.0012695 

80 

0.002880587 

0.002880624 

-0.000000037 

-0.0012756 

90 

0.000152305 

0.000152307 

-0.000000002 

-0.0013714 
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Had we used cos(31°) instead of cos(30.5°), we would find 

0.017453293 cos(31°) = 0.014960392 
sin (31°) - sin(30°) = 0.015038075 
absolute error = 0.000077683 
relative error (%) = 0.5165751 

Here we see that the relative error at the boundaries of the 6 interval has 
jumped from ~t^o% when 6 is taken at the midvalue of the interval to 
when 9 is taken at the end value of the interval. 



CHAPTER 2 


DIFFERENCE TABLES, 
DATA ANALYSIS, AND 
FUNCTION EVALUATION 


2-1 INTRODUCTION 

This chapter deals with interpolation, extrapolation, and smoothing of 
tabulated data. Many books on numerical analysis discuss these topics as 
related to the use of mathematical tables. Though we are interested in the 
use of these methods for precision table lookup, this chapter aims mainly 
to develop functions that are simple in form that can be used to replace 
complex functions. This technique, called analytic substitution, is com¬ 
monplace in advanced analysis. For example, cost data developed on 
computer programs with as many as 500 cost-estimating relationships 
(CERs) can be used to generate a table of costs as a single design 
parameter is changed. It is often convenient to develop an interpolation 
formula based on the table of discrete costs which will compute system 
cost as a function of the single design parameter. The simpler formula can 
be analytically substituted for the entire complex system of CERs in the 
large-scale cost model. This reduces the cost of “cost estimating” and 
makes the simplified models convenient to analyze on the pocket calcula¬ 
tor (see Chapter 11). We will also investigate the smoothing of tabulated 
data on the basis of estimates of the error propagated in a difference table. 
Finally, we will study what is perhaps the most important but seemingly 
least developed use of data tables, extrapolation or prediction. Here 
projections, predictions, and identification of trends and predicted values 
of function are discussed both from the viewpoints of mathematical 
limitations and the practical necessity to predict the behavior of dynamic 
processes from their data tables. 
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2-2 DIFFERENCE TABLES OF EQUALLY SPACED DATA 

Before the age of pocket calculators, the preparation of extensive 
difference tables of data with mechanical calculators was laborious and 
noisy at best, and frustrating at worst. In practical analysis they require 
carrying numbers to at least as many as five significant digits. A table of 
finite differences of n numbers and m differences requires 

m(2n — m — 1) 

2 

differences to be calculated and recorded in the difference table. For a 
table of 50 entries and 5 differences, this involves 235 differences to be 
computed. Thus 470 data entries must be made, which took about an hour 
on the old mechanical calculators. On the electronic pocket calculator 
these calculations are done quickly and quietly, with the time-limiting 
element being the analyst’s preparation of the difference table. Tables of 
fifth-order differences of 50 numbers can be conveniently prepared in 
approximately 15 minutes with any pocket calculator. 

The difference tables that we are concerned with here are usually 
generated in two ways. Either a function is evaluated for certain values of 
its independent variable or data are determined by measurement of an 
experiment. In both cases the tables of equally spaced data can usually be 
prepared, especially of data determined from experiments, since much of 
experimental electronics and data sampling is done digitally and can be 
time-referenced to a digital clock. We discuss arbitrarily spaced data later 
in this chapter. 

Our notation is based exclusively on the definition of the forward 
difference: 


hyi=y i+] -yi=y(xo+[i+ \}&x)~y(x 0 -Hhx), / = 0,1,2,...,/? 


Figure 2-1 illustrates the definitions of the differences involved in the 
difference table. Occasionally we use the term h to represent the spacing of 
the data, that is, A x~ h — x n+{ — x n . We do not use backward differences 
or central differences in this book. Backward and central differences are 
only useful for changing the form of equations used in the derivation of 
numerical approximation methods. Since our interest here is not in 
manipulating equations but in their numerical evaluation, we use only the 
forward difference notation. Repeated application of the definition of the 
forward difference generates higher-order differences. For example, the 
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Figure 2-1 Definition of differences of equally spaced data. 


second-order difference is derived as 

A^Ay^-Ay, 

=/,+2->’«+1 -Oi+i - y t ) 

= ^, + 2 — 

The third-order difference is developed as 

AV = AV +1 - AV 

= Ay, +2 - Ay i+ , - (Ay, + , - Ay,) 

= (y,+3-y l+2 ) - (x+2~>’,'+1) ~ (y,+2~y,+ 1) + Owi -yi) 

=>'/ + 3- 3 >'/ + 2 + 3 A+l-^ 

The differences can be numerically evaluated using the equations just 
developed, or they can be computed directly from the tabulated values of 
the dependent variable, as shown in Figure 2-2. 

The nth difference operator is given by the formula 


A" = (z — 1)” = z " — «z' 


l + 


n{n-\) 


-z — 


2 


( 2 - 1 ) 
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y 

Ay 

w 

<1 

<1 

*0 

7 0y 

o 

<] 



*1 

y i 

<] 

V 

>>o 

*2 

72 

/ 


( >, 



V72 

/ 

>w/ 

*3 

73 

/ 

>73 

>^ 2 

/ 

*4 

7 4 ' 

/ 



X 

7 

A7 


A 3 y A> A 5 y 


0 0 

1 

1 1 14 

15 36 

2 16 50 24 ( b) 

65 60 0 

3 81 110 24 

175 84 

4 256 194 

369 

5 625 

Figure 2-2 Finite difference tables, (a) Difference table definition. ( b ) Numerical example 


where z is the shifting operator defined by the relation 

z[y(z)]=y(x+kx) 

Furthermore, by repeated application of the shifting operator we see that 

z"[y(x)]=y(x + nAx) 


Equation 2-1 is derived by noting that the forward difference and shifting 
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operators are related as follows: 

4 v, = y, + i-yi- zy, -y, 

A"y, = (z-\ )"y, 

Note also that equation 2-1 can be written in the form 

A n y i = [z n -C(n t \)z n ' l + C(n,2)z n ~ 2 - ]y t 

^"yi=yi+n-C(n,\)y, + „-x + C(n,2)y t+n _ 2 - 

where 

C{n,m) = -— -- 

m ! (n — m)\ 

which is the mth binomial coefficient of order n. 


2-3 DATA INTERPOLATION 

Armed with these definitions, we are now prepared to examine a number 
of formulas for analytic substitution or for interpolation. The method that 
we use here involves a Lozenge diagram of differences and binomial 
coefficients which can be combined into interpolation formulas. The 
diagram is shown in Figure 2-3. Certain rules applied along paths across 
the diagram proceeding from left to right define interpolation formulas. 
This diagram is so general that it encompasses both Newton’s forward and 
backward difference formulas, Stirling’s interpolation formula, Bessel’s 
interpolation formula, and an interesting and unusual formula due to 
Gauss which zigzags across the diagram. The rules to be followed that 
generate these and many more interpolation formulas are the following: 

1. When moving from left to right across the diagram, sum at each step. 

2. When moving from right to left across the diagram, subtract at each 
step. 

3. If the slope of the step is positive, the term in the interpolation 
formula for that step is the product of the difference crossed times the 
factor immediately below it. 

4. If the slope of the step is negative, the term is the product of the 
difference crossed times the factor immediately above it. 

5. If the step is horizontal and passes through a difference, the term is 
the product of the difference times the average of the factors above and 
below it. 
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6. If the step is horizontal and passes through a factor, the term is the 
product of the factor times the average of the differences above and below 
it. 


-3 


- 1 


1 Ay ( — 4) s 

t(-3^C(« + 3,1) 


C(n + 4,2) 

>, 2 (-4) ' 

C(« + 3,2)^ A 3 7(-4) ; 



C(n — 3,2) 


Figure 2-3 The Lozenge Diagram. 


CO+ 5,4) 
A?(-5) 
CO+ 4,4) 

, A ,V 4 ) 

CO + 3,4) 
A, 4 (-3) 
CO+ 2,4) 
A 4 ( — 2) 
CO + 1,4) 

AM-0 

CO,4) 
AV(0) 
CO-1,4) 
A'V(l) 

CO-2,4) 


Following these rules, starting at^(O) and going down and to the right, 
we generate the interpolation formula 

y(n) =/(0) + C(n, l)Ay (0) + C(«,2)A 2 y(0) + • • ■ 

which becomes 


y( n ) = y(fy + w Av (o) + ——-AM°) + • • • 

This is Newton’s forward difference interpolation formula. To generate 
Newton’s backward difference formula, the procedure is reversed. Starting 
at^(0) and moving up and to the right, we generate the formula 


+ («)=t(0)+ C(n. l)Ay(~ 0 + C(n+ l,2)A 2 v(-2) + • • • 
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which becomes 

n(n+ 1) 

y(n)=y(0) + nAy( - 1)+---A 2 v(-2)+ • • • 

This is Newton’s backward difference formula. 

To develop Stirling’s formula, we start aty(O) and move horizontally to 
the right. In this case, we generate the interpolation formula 

f Ay (0) + Z 

y{n)=y(0)+C(n, 1)|-^ 

C(n + 1,2) + C(n,2) 

2 

,, ^ f A^(0) + Ay(-1) ] , n\ X/ n , 

y(n)=y(0) + n ! - 2 - + Y A 7(- 1) + • • ■ 

Bessel’s formula can be generated by starting midway between y(0) and 

T(l). 






+ 


y(n) = 


f +(0)+y(l) ) 


[ 


‘ 


+ (n-i)Ay(0) 


+ »(»-!) | AV(- 1) + A^ (0) ] 


l + - 


Clearly, a great number of other formulas can be generated and used for 
interpolation of data. 

Interpolation is often employed in computing intermediate values of 
tabulated functions. While the scientific pocket calculator gives sine, 
cosine, tangent, arc sine, arc cosine, and arc tangent (and, for some of the 
more advanced scientific machines, hyperbolic sine, hyperbolic cosine, and 
hyperbolic tangent), they usually do not have the capability of generating 
Bessel’s functions, Legendre polynomials, error functions, and the like. 
Those are often more easily evaluated with standard reference tables. In 
these cases, it is occasionally necessary to interpolate between two values 
in the table. 

Before discussing the interpolation process, however, it is worth pointing 
out that most well-made tables are often generated with auxiliary functions 
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as opposed to the actual functions themselves. For example, the exponen¬ 
tial integral with positive argument is given by 


Ei 




u 


which takes the series form 


Ei(x) = y + ln(x) + 


■ + 


+ 


M! 2-2! 3-3! 


+ 


can be approximated with the series 


Ei(x) = — 
x 


n 2 f 3' 

l + -^ + 4 + 4 + 

X X 2 X 3 


(x—> oo) 


The logarithmic singularity in the first series does not permit easy interpo¬ 
lation near x —0. The function Ei(x) — lnx is better behaved and more 
readily interpolated when x is near zero. In fact, x _1 [Ei(x)~ ln(x)— y] 
(where y is Euler’s constant 0.577- • •) is an auxiliary function that results 
in a slightly higher interpolation accuracy than when Ei{x) is computed 
from interpolated values of the table of Ei{x) directly. 

Generally tables are constructed and presented so that reasonable-order 
interpolating polynomials (i.e., first-, second-, or third-order) can be used 
to compute intermediate values while retaining the precision of the table. 

For example, in the Handbook of Mathematical Functions (U.S. Depart¬ 
ment of Commerce, Bureau of Standards, Applied Mathematics Series 55) 
most tables are accompanied by a statement of the maximum error in a 
linear interpolation between any two numbers in the table, and the number 
of function values needed in Laplace’s formula or Atkins’ method to 
interpolate to nearly full tabular accuracy. 

An example from the Handbook of Mathematical Functions appears in 
Table 2-1. The accuracy statement is given in brackets. The numbers in 
brackets mean that the maximum error in a linear interpolate is 3x 10“ 6 
and that to interpolate to the full tabular accuracy, five points must be 
used in Lagrange’s method or Atkins’ method of interpolation. The linear 
interpolation formula is 


f P =0-p)fo+pf> 
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Table 2-1 Exponential Integral Auxiliary Function 


X 

xe x E l (x) 

X 

xe x E } (x) 

7.5 

0.892687854 

8.0 

0.898237113 

7.6 

0.893846312 

8.1 

0.899277888 

7.7 

0.894979666 

8.2 

0.900297306 

7.8 

0.896088737 

8.3 

0.901296023 

7.9 

0.897174302 

8.4 

0.902274695 


(-6)3 

5 


where f 0 J Y are consecutive tabular values of the function corresponding to 
arguments x 0 ,x, respectively; p is the given fraction of the argument 
interval 


(*i“*o) 


and f is the required interpolate. For example, if we interpolate between 
the values of Table 2-1 for x = 7.9527, we find that 

/ 0 = 0.897174302 

/, =0.898237113 

p = 0.527 

We then obtain 

fo. 52 i = ( 1 “ 0.527)(0.897174302) + 0.527(0.898237113) 

/ 0 .527 = 0 . 897734403 . 

The terms in the brackets indicated that the accuracy for linear interpo¬ 
lation was 3x 10 -6 . Thus we round this result to 0.89773. The maximum 
possible error in this answer is composed of the error committed by the last 
rounding, that is, 0.4403 X 10~ 5 + 3 X 10 -6 , and thus certainly cannot ex¬ 
ceed 0.8 X 10~ 5 . 

To get greater precision, we can interpolate this example of the table 
using Lagrange’s formula. In this example, the interpolation formula is the 
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five-point formula: 


f(x 0 + pkx) = 


(p 2 - \ ){p-2)p 

24 


7 - 2 - 


, , (p 2 -1)(P"2)p ) r 

+ { /I (JO' 


(p~\)(p 2 -4)p 


(p+\)(p 2 --4)p \ 


fi 


+ 


( P 2 - 00 + 2)/? 

24 


4 01 <1 


Another approach is to use a five-term Newton forward or backward 
difference formula, a Bessel’s formula, Stirling’s formula, or any of the 
formulas that come out of the Lozenge diagram. The details associated 
with such interpolations are conveniently found in Chapter 25 of the 
Handbook of Mathematical Functions. 

Since there are occasions for using inverse interpolation, we discuss it 
briefly here. If we are given a table of values of the dependent variable y n 
as a function of values of the independent variable x n . 


y n = f(x n ) (tabulated function) 


then intermediate values of y can be computed by interpolating between 
the values y n with an interpolating polynomial g(?c) as 

y = g(.x) =f(x) (continuous function) 

Inverse interpolation is a matter of viewpoint. Here we would view the 
interpolation from the standpoint of the dependent variable, 

x n = f~\y n ) (tabulated function) 

Then intermediate values of x can be computed by interpolating between 
the values of x n with an interpolating polynomial h(y) as 

x = h(y)=f x (y) (continuous function) 

With linear interpolation there is no difference in principle between 
direct and inverse interpolation. In cases where the linear formula is not 
sufficiently accurate, two methods are available for accuracy improvement. 
The first is to interpolate more accurately by using, for example, a 
higher-order Lagrange’s formula or an equivalent higher-order polynomial 
method. The second is to prepare a new table with a smaller interval in the 
neighborhood of interest, and then apply accurate inverse linear interpola¬ 
tion to the subtabulated values. 
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It is important to realize that the accuracy of inverse interpolation may 
be very different from that of a direct interpolation. This is particularly 
true in regions where the function is slowly varying, such as near flat 
maximum or minimum. The maximum absolute error Resulting from in¬ 
verse interpolation can be estimated with the aid of the formula 

fa= (S) a >” s - v 

where <$y is the maximum possible error in the tabulation of y values 
and A/ and Ax are the first differences generated from the table in the 
neighborhood of the region of interest. 

Let us now return our attention to the generation of difference tables. In 
the generation of interpolating polynomials of reasonable size, the finite 
differences in the difference table must be small for high-order differences. 
If they are not small the questions is, “What can we do to reduce the size 
of the finite differences?” 

There are only three considerations associated with any difference table. 
The first is the number of differences to which the table is taken the 
second is the spacing between different values of the tabulated function, 
and the third is the number of figures tabulated. The effect of halving 
(factor of spacing in the independent variable x is to divide the first 
differences by 2, the second differences by 4, the third differences by 8, 
and so on. Examples of the effect that different spacings of x have on the 
function y = x 3 appear in Table 2-2. In answer to the question above then: 
to reduce the nth-order difference by a factor k we must reduce the data 
interval (independent variable) by a factor of approximately "vT . 


2-4 DATA EXTRAPOLATION 

Extrapolation outside the range of data that makes up a difference table is 
a controversial procedure. It is, however, a procedure of great practical 
interest. Given the behavior of a dynamic process, sampled at intervals, it 
is only natural to ask to what extent the table can be extended beyond the 
range of the data used to make up the table to predict the future behavior 
of the process being considered. This is a very practical, important, and 
real matter. It is the problem of science to be interested in predicting the 
behavior of systems based on observations of their past behavior. While 
there are a number of stock market “chartsmen” who use finite difference 
techniques, it is generally accepted that extrapolation outside the range of 
the difference table is as much an art as a science. Because of its practical 
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Table 2-2 The Effect of Interval Halving on The Finite 
Differences in a Difference Table 0 



Full interval Ax = 

2 

Half-interval Ax = 1 


X 

y = x 3 

A y 

AV 

A 3 y 

x y 

= x 3 

Ay 

ft 

< 

0 

0 

8 



0 

0 

1 



2 

8 

56 

48 

48 

1 

1 

7 

6 

6 

4 

64 

152 

96 

48 

2 

8 

19 

12 

6 

6 

216 

296 

144 

48 

3 

27 

37 

18 

6 

8 

512 

488 

192 


4 

64 

61 

24 


10 

1000 




5 

125 





°Third-order difference is reduced by a factor of 8 when 
interval between values of x is halved. 


value and practical interest, it will be covered here but with the proviso 
that the reader recognize that extrapolation is a questionable procedure. 
That is, the same difference table using only slightly different extrapolation 
techniques can, and usually does, lead to significantly different predictions. 
Because of this lack of robustness of extrapolated data, the procedure has 
questionable value. 

We illustrate the problem of prediction with the following practical 
example. Consider an aircraft executing a fully automatic landing. 
Sampled values of the altitude are shown in Table 2-3. What will be the 
conditions at touchdown? This particular example is a nontrivial one, in 
that the heart of present-day flight-control performance monitors hinges 
on the ability to predict the dynamic behavior of high-energy devices, such 
as aircraft, when terminal operations are under automatic control. The 
obvious first step is to form the difference table, as shown in Table 2-3. We 
note that this table can be carried to the third difference without the 
lower-order differences becoming constant. The obvious next step to 
extrapolation is to assume that the third difference holds constant up to 
touchdown, and to predict the behavior shown in Table 2-4. Clearly the 
predicted results are fairly grim. We have low confidence in extrapolations 
of this type because the difference table did not indicate the influence of 
any control law through arriving at constant differences between any of 
the finite differences. Had we found, for example, that all of the second 
differences held constant and the third differences were zero, we might be 
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Table 2-3 Difference Table of Altitude of an 
Aircraft Executing an Automatic Landing 


t 

MO 

A h 

A 2 h 

A 3 h 

0 

60 

-13 



1 

47 

-10 

+ 3 

-2 

2 

37 

-9 

+ 1 

0 

3 

28 

-8 

+ 1 


4 

20 





entitled to higher confidence in the extrapolation to touchdown by assum¬ 
ing that the guidance law objective was to hold the third-order differences 
to zero. It follows, then, that a procedure for increasing the confidence in 
extrapolation from finite difference tables is to find a transformation of the 
variable of interest which would expose the guidance law and its effect on 
the difference table. 


Table 2-4 Extrapolated Touchdown Conditions of Automatically Landed Aircraft 

t h(t ) A/i A 2 h A \ 


/ 0 


\ 1 

Range of I 2 


actual data \ 


/ 3 


\ 4 


( 5 


Range of < 6 

Touchdown 

extrapolation i 

sink rate 

V 7 

~11 fps* 


(hard) 


60 

-13 

47 +3 

-10 - 2 ) 

37 +1 > Average = -1 

-9 0) I 



3 -2 

-11 


8 


fps = feet per second. 
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After a little thought, it might be expected that the guidance law to 
automatically land an aircraft would be an exponential law of the form 

4j± = -k(h + h„) 

which results in a flared landing path of the form 

h = ho e ~ k ‘-h B 

which is sketched in Figure 2-4. This suggests the formation of the 
difference Table 2-5. Note that the second difference is near zero so that 
for short-term prediction (next 7 seconds) a reasonable assumption is that 
Ain (h) is approximately constant and equal to -0.3062. The differences 
between the two approaches are tabulated in Table 2-6. It is apparent that 
the logarithmic extrapolation does better short-term prediction than does 
the“third-order” extrapolation. 



0 600 1200 1800 2400 3000 (feet) 

Figure 2-4 Typical jet transport landing trajectory (fps = feet per second). 


In summary, we might expect to extrapolate with greater confidence 
from Table 2-5 than Table 2-4 because of the observed characteristics of 
the guidance law. From a strictly mathematical viewpoint, the issue is not 
all that clear. The number of actual samples of the second difference is 
small and thus the true mathematical confidence in the fact that the 
guidance law is in some way holding the second difference to zero is low. 
In the end, extrapolation using difference tables involves careful judgment. 




Table 2-5 Logarithmic Extrapolation of Touchdown Conditions of 
Automatically LandedAircraft 



t 


h{t) 


A h 




I ° 


60 

4.0943 

-0.4054 

N 



V 1 


47 

3.8501 


j 


Actual 

data 

( 2 


37 

3.6109 

-0.2442 

f 

/ Averages 

0.3062 

range 

f 3 


28 

3.3322 

-0.2392 








-0.3365 

) 



\ 4 


20 

2.9957 

- 0.3062 




/ 5 


14.7 

2.6895 

-0.3062 




6 


10.84 

2.3833 

-0.3062 



Range 
of ex- , 

l 7 


7.98 

2.0771 

-0.3062 



trapo- ^ 

8 


5.876 

1.7709 




lation 

1 9 ) 

Sink 

4.326 

1.4647 

-0.3062 



I 

\ 

( 

■ rate 



-0.3062 



\ 

10 ) 

\ 

1 —1.141 fps a 

3.185 

1.1585 

-0.3062 




11 


2.345 

0.8523 




a fps = feet per 

second. 






Table 2-6 

Comparison of Extrapolation Methods for Predicting 


Touchdown Conditions 












First-Order Logarithmic 

Actual 

“Third-Order” 

Extrapolation 

Extrapolation 





Absolute 


Absolute 

t 


h 

h 

Error 

h 

Error 

5 


16 

12 

- 

-4 

15 

-1 

6 


11 

3 

- 

-8 

11 

0 

7 


7 

-8 

- 

15 

8 

+ 1 

8 


4 




6 

+ 2 

9 


1.6 




4 

+ 2.6 

10 


0 




3 

+ 3 

11 


0 




2 

+ 2 


a fps = feet per second. 
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2-5 DATA ERROR LOCATION AND CORRECTION 

Errors due to observations, calculation, measurement, or recording often 
occur in a table of numbers. These errors introduced into the calculation 
process are significantly magnified in the generation of ascending 
differences in the difference table. This can be seen in Table 2-7. It is 
apparent that the errors propagate and are distributed binomially (in any 
given difference the errors are weighted by binomial coefficients). It is also 
apparent that the error grows rapidly as it propagates into ascending 
orders of difference. For example, the error in Table 2-8 might be antici¬ 
pated by noting the form of the third difference. We see the pattern of 
signs ( + ), ( —), ( + ), ( —) indicative of error propagation. Also, note that 
the pattern of fourth difference is centered ony=17. Furthermore, note 


Table 2-7 Error Propagation in Difference 
Tables 


y A y h?y A 3 y A 4 y 


0 

0 


0 


0 




0 


0 


0 


0 


+ € 


0 


+ € 


0 


+ € 


-4c 


+ € 


3 € 


T € 


-2c 


+ 6e 


— € 


+ 3e 


0 


+ e 


_ 4<e 


0 


— e 


0 


0 


+ e 


0 


0 


0 


0 




0 

0 
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Table 2-8 Unit Error Propagation in the Difference 
Table for the Function y = x 2 


X 

y 

Ay 

A V 

<t 


0 

0 

1 




1 

l 

1 

3 

2 

0 


2 

4 

5 

2 



3 

9 

^8- 


-3 

-4 

4 

17<; 


0 


+ 6 

5 

25 

8- 


+ 3 

-4 



11 




6 

36 


2 


""+1 



13 


0 


7 

49 

15 

2 



8 

64 






that 6e in Table 2-7 corresponds to 6 in Table 2-8; that is, 


6c = 6 


e = 1 


Moreover, if the error in the values of y were of the form 

y = x 1 2 + 5 


we can expect in the fourth difference column to show an error of 6k. 
Thus one-sixth of the fourth difference which is centered on the number in 
error is a measure of the error—which can then be subtracted from the 
column of y values. We might modify Table 2-8 by replacing 17 with 
(17-1)= 16, thus obtaining the difference table shown in Table 2-9. In 
general, then, data smoothing is done by: 

1. Keeping an eye open for the ( + ), (-), ( + ), ( — ),*•• pattern in 
high-order differences that indicates error propagation. 

2. Identifying the tabulated value on which the pattern is centered. 
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Table 2-9 Smoothed Data Table for the Function 


X 

y 

Ay 


A 3 y 

A *y 

0 

0 

1 




1 

i 

3 

2 

0 


2 

4 

5 

2 

0 

0 

3 

9 

7 

2 

0 

0 

4 

16 

9 

2 



5 

25 






3. Equating observed error with its binomial error counterpart. 

4. Solving for the error and appropriately modifying the data table. 

5. Testing the table for elimination of the ( + ),( — ),( + ), ( — ), ...pattern. 

2-6 MISSING ENTRIES 

Occasionally a difference table has a few missing entries in the dependent 
variable. Missing entries in the difference table can be estimated in several 
ways. 

The simplest method is to examine the table and decide whether the 
points could be reasonably fit with a polynomial. For example, a data 
table with four points, one of which is unknown, might be fit with a 
second-degree polynomial. It is characteristic of difference tables that 
nth-order differences of polynomials of degree n— 1 equal zero. For 
example, the equation 

y = 2jc 2 + jt + 3 

has the difference equation shown in Table 2-10, where it is apparent that 
the third-order differences equal zero. This characteristic is present in 
general in nth-order polynomials; that is, their (n + l)st-order (and all 
higher) differences equal zero. Using this property, we would expect that 
the fourth-order difference would equal zero; that is 


A 4 (y) = 0 



Missing Entries 
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Table 2-10 Difference Table for y = lx 2 + at + 3 


Subscript 
in Missing 

Entry Formula 


y 

A y 

A V 



0 

3 

3 




1 

6 

7 

4 

0 

0 

2 

13 

11 

4 

0 

1 

3 

24 

15 

4 

0 

2 

4 

39 

19 

4 

0 

3 

5 

58 

23 

4 

0 

4 

6 

81 

27 

4 



7 

108 





This can be rewritten in the shifting-operator notation as 
(z- l) 4 y = (z 4 -4z 3 + 6z 1 2 3 —4z + l)y = 0 

This gives us 


y 4 ~ 4^3 + 6 T 2 - 4>h +^o = 0 


y2=i[*(y\+y3)-(y4+yo)] 

Here we use an even-order difference because all even-order difference 
equations do the following: 

1. Give one middle term, which can be centered in the missing number 
in the table. 

2. Result in missing entry determination with a minimum of roundoff 
error. 

3. Are numerically more stable than their odd-order counterparts. 

Let us assume, for the sake of the discussion, that the y — 39 entry is 
missing in Table 2-10. We can substitute directly from the table with the 
missing data point to obtain 

yi~ s [4(24+ 58)-(81 + 13)] 
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from which we can solve for the missing data point: 

_y 2 = ±[4(82) - 94] = £(328 - 94 = 234) = ^ = 39 

The method just described for filling in missing values in the data table 
is particularly suited to analysis on the pocket calculator in that it does not 
involve the determination of unknown coefficients in a polynomial (the 
usual methods for missing data determination). For tables with large 
numbers, the arithmetic could be tedious, but with the pocket calculator it 
is a simple matter to perform the sums and products for tables of large 
values requiring high precision. Another point worth making regarding 
identification of missing entries in data tables is that for tables with large 
numbers of values, say on the order of 20 to 100, it is not necessary to look 
for twentieth-order differences to develop the formula for computing the 
missing data. One need only determine the polynomial that can be rea¬ 
sonably expected to fit locally through two, four, or six data points 
symmetrically placed about the missing value to find the missing point. 

We have been stressing the determination of interpolating polynomials 
by way of finite difference tables because the pocket calculator enables 
one to find finite differences quickly and conveniently, thus leading 
immediately to interpolation formulas of high order and high accuracy, 
which themselves can be evaluated on the pocket calculator conveniently 
and to high precision. This, in fact, is the reason for using the pocket 
calculator with difference tables: high-order difference tables lead to 
high-order approximating polynomials, which, when written in nested 
parenthetical form, are easily evaluated on the pocket calculator to high 
precision. 

The difficulty in using low-order polynomials for manual analysis in the 
precalculator era was that they generally were not sufficiently accurate to 
permit the precision numerical evaluation necessary for most engineering, 
economic, chemical,and other types of precision analysis. On the pocket 
calculator we can conduct precision analysis relatively quickly and 
efficiently by using high-order polynomials generated simply with 
difference tables of high order. 


2-7 LAGRANGE’S INTERPOLATION FORMULA 

So far we have studied the interpolation of equally spaced data through the 
use of difference tables and the Lozenge diagram as a convenient means 
for remembering a large number of different interpolation formulas. These 
interpolation formulas, however, do not apply to nonequally spaced values 
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of the independent variable nor when the «th differences of the dependent 
variable are not small or zero. In these cases we can then use Lagrange’s 
interpolation formula to develop a polynomial that can be used for 
analytic substitution. Though there are other interpolation formulas for 
unequally spaced data, the advantage to using Lagrange’s interpolation 
formula is that the coefficients are particularly easy to remember, and to 
determine, with the pocket calculator. The method works for both nonequ- 
ally and equally spaced data and regardless of whether the /?th differences 
are small. Lagrange’s interpolation formula is 

(*-*i)(*-* 2 )‘••(■*-*/>) ^ (-y-* 0 )(*-* 2 )- ‘ ‘ (x-x p ) 

y y ° (*o“ x x )(x 0 ~ x 2 ) '"(x 0 -x p ) 1 (*1 - *o)(*1 - *2) * * • 0*1 - x p ) 

(x-x 0 ){x-x x )> • • (x-x : x ) 

+ . . . +y --- 

P (x p -x 0 )(x p -x l )'-(x p -x p _ l ) 

An interesting and important feature of Lagrange’s interpolation for¬ 
mula is that, if the data table has n entries, the formula appears to have n 
terms. It turns out, however, that if the table amounts to four or five 
samples of, say, a second-order polynomial, the terms will cancel, giving 
only the pieces due to the quadratic function. As an example of this, 
consider the data table shown in Table 2-11. Using Lagrange’s interpola¬ 
tion formula, we have 

(* — 5)(x — 7)(jc — 9 )(jc —11) t (*-3)(*-7)(*-9)(*-ll) 
(3-5)(3-7)(3 —9X3—11) (5-3)(5-7)(5-9)(5-11) 

(*-3)(x-5)(*-9)(*-‘ll) (x-3)(*-5)(*-7)(*-ll) 

+ 5S (7 —3)(7 —5)(7 —9)(7 — 11) (9 — 3)(9 — 5)(9 — 7)(9 —11) 

(*-3)(*-5)(*-7)(*9) 

(11 3)(11 5)(11 7)(11 9) 

The reader can now simplify the equation. It will be found that 

y = 2x 2 — lx + 9 

which is a polynomial of degree two, rather than of the fourth degree, as 
might be expected from the fourth-order polynomials in the numerators of 
all the terms in Lagrange’s interpolation formula. Because of roundoff the 
exact cancelation of the coefficients for the higher powers of x will not 
occur, but they will be very small, indicating that they should be made 
zero. 
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Table 2-11 Five Evaluations of a Quadratic 

Equation 

x 3 5 7 9 11 

y 6 24 58 108 174 


2-8 DIVIDED DIFFERENCE TABLES 

Another approach to the generation of interpolation formulas for tables of 
data of unequally spaced values of the independent variable is to prepare a 
table of divided differences. Assuming that the values of the independent 
variable * are x Qy x v x 2 ,x 3 • • • and that the value of the dependent variable 
is y —f(x), we can prepare a table of successive divided differences of the 
form 


/(*0>*l) 


*i-*o 


f(x 0 ,x v x 2 ) = 


f(x v x 2 )-f(x 0 ,Ai) 


X") An 


f(x Qy x ly x 2y x 3 ) = 


f(x ] ,x 2 ,x 3 )-f(x 0 ,x l ,x 2 ) 
x 3 -x 0 


These terms are commonly called divided differences of orders 1,2,3, 
and so on. We can now prepare a table of divided differences, as shown in 
Table 2-12, 


Table 2-12 Divided Difference Table 


*( 0 ) 

/(* o»*i) 

x(\) f(x 0 ,X l ,X 2 ) 

f(Xi,X 2 ) 

x(2) f(x lt x 2 ,x 3 ) 

f(x 2>X 3 ) 

x(3) f(x 2 ,X 3 ,X 4 ) 

f(x 3 ,x 4 ) 

x(4) 


f(x Qy x x ,x 2 ,x 3 ) 

f(x h x 2 ,x 3 ,x 4 ) 


f(x 0 ,x x ,x 2 ,x 3 ,x 4 ) 
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In the same way that the ( n+ l)st difference of an «th-order polynomial 
was zero, it is found that the /tth-order divided difference of an ^th-order 
polynomial is zero. Newton’s interpolation formula, based on divided 
differences, is of the form 

y=f(xo) + (x-x 0 )f(x 0 ,x l ) + (x-x 0 )(x-x l )f(x 0 ,x v x 2 } + ’ •' 

+ (x-x 0 )(x-x l )- • • (x-X„_ l )f{x 0 ,X l ,...,X n ) + e(x) 

where 


«(*)- 


f"\e) 

n\ 


n 


n (x-x k ), 

k — 0 


9 is between the largest and smallest ofx, x 0 ,xx n 


2-9 INVERSE INTERPOLATION 

We have been concerned with the interpolation to determine values of the 
dependent variable, given either equally spaced or unequally spaced values 
of the independent variable. Inverse interpolation involves finding values 
of the independent variable, given a table of values of the dependent 
variable. In particular, the method is useful for finding missing values of 
the independent variable in a tabulated set of data. A nice feature of 
numerical analyses using finite or divided difference tables is that inverse 
interpolation is performed in identically the same way in which interpola¬ 
tion is conducted. That is, the procedure is an interpolation process where 
the dependent and independent variables are switched. The values of x are 
to be determined and thus are ‘"dependent” on the values of y. Hence the 
interpolation problem is one of developing an interpolating polynomial 
through the sequence of values of the independent variables in the prob¬ 
lem. The procedure is then identical to regular interpolation. 


2-10 THIRTEEN-PLACE PRECISION FROM TWO-DIGIT TABLES 

An interesting aspect of the use of difference tables that is consistent with 
the high precision of pocket calculators is that specially prepared 
difference tables permit precision interpolation to the accuracy of the 
calculator’s capability, but with table entries of apparently only two or 
three significant digits. In fact, the accuracy is known to an infinite number 
of digits but only two significant digits are nonzero. For example, a table 
of y = x 2 can take either the form shown in Table 2-13 or that shown in 
Table 2-14. 



76 


Difference Tables, Data Analysis, and Function Evaluation 


Table 2-13 Difference Table for y = x 2 where Ax = 1, 
x 0 = T7<Two-Place Accuracy)" 


X 

y 

Ay 

A 2 )' 

A^ 

3.14 

9.87 

7.27 

2.01 


4.14 

17.14 

9.28 


-0.01 

5.14 

26.42 

11.28 

2.00 

0 

6.14 

37.70 

13.28 

2.00 

0 

7.14 

50.98 

15.28 

2.00 


8.14 

66.26 





“Interpolation formulas based on this difference table 
can only be accurate to two places after the decimal 
point at best, no matter how high the order of the 
interpolation formula, because the differences are only 
known to two places. 


Both difference tables are developed with integer differences in the 
dependent variable and precisely known values of the independent vari¬ 
able in Table 2-14 and two-place accuracy in Table 2-13. The difference 
between these two tables is that the first permits interpolation to an 
accuracy of only two places, while the second interpolation permits an 
accuracy to 10 places, even though both are based on entries in the table 
that are only known to a few significant figures. It is precisely in this 
manner that the scientific pocket calculator, and even the simple four- 
function pocket calculator, can be used to boot-strap itself to generate 
advanced mathematical functions to an extremely high precision. All that 
is required is that certain values of both the dependent and independent 
variables of the advanced mathematical function be known precisely— 
where only a small number of nonzero digits make up the number. These 
values can then be used in a high-order difference table to generate an 
interpolation formula that will be very accurate over the range of the data 
table. 
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Table 2-14 Difference Table for y = x 2 where 
Ax — 1, x 0 = 3 (oo Place Accuracy) 4 * 


X 

y 

Ay 


A 3 y 

3 

9 

7 



4 

16 

9 

2 

0 

5 

25 

11 

2 

0 

6 

36 

13 

2 

0 

7 

49 

15 

2 

0 

8 

64 

17 

2 


9 

81 





a Interpolation formulas based on this 
difference table can be as accurate as the 
order of the interpolation formula will allow 
because the differences are known precisely. 


Example 2-1 Using the definition 

A = z — 1 

write an expression that will interpolate between data points and 
differences in a data table. Since 

z = (1 + A) 


z ” = (1 + A)” 

then 


n{n — 1 ) 

z"[^(x)]=7(x + «Ax)=^(x) + «A>’(x)-1---A 2 >'(x)+ • • • A"k(x) 

Note that this is Newton’s forward difference interpolation formula. 

Example 2-2 Use a difference table to check an interpolating polynomial. 

We can check Newton’s or any other interpolation formula by substitut¬ 
ing data points from a known polynomial such as y — x 2 (see Table 2-15) 
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into the formula. Then/(A + «Ax) becomes 

^(04- n)=y(n) = 0 + n + n(n— 1 ) = n + n 2 — n=y(n) = n 2 

We see that the interpolation formula gives the original polynomial y(x ) 
— x 2 again: a result to be expected, since the interpolation formula is itself 
a polynomial. 


Table 2-115 Numerical Example for 
y — x 1 


X 

y 

Ay 

A * 

A 3 y 

0 

0 

1 



1 

1 

3 

2 

0 

2 

4 

5 

3 


3 

9 
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PART TWO 


NUMERICAL EVALUATION 
OF FUNCTIONS 

ON THE POCKET CALCULATOR 




CHAPTER 3 


ELEMENTARY ANALYSIS 
WITH THE POCKET CALCULATOR 


3-1 INTRODUCTION 

A number of analytical topics used in elementary analysis are discussed 
here. Among them commonly used progressions including arithmetic, 
geometric, harmonic, and concepts of generalized means; the detailed 
definitions of absolute and relative error; nested parenthetical forms of 
commonly used infinite series including Taylor’s series; certain often- 
encountered forms of the binomial series; the reversion of series; and 
methods for transforming series that converge slowly into series that 
converge more quickly. Also discussed are methods for evaluating the 
roots of polynomials including quadratics, cubics, quartics, and quintics; 
methods for the numerical evaluation of transcendental functions and for 
solving plane and spherical triangles; and methods for numerically evaluat¬ 
ing commonly encountered functions of complex variables. The formulas 
and equations used for pocket calculator analysis are written in forms most 
convenient for evaluation on the pocket calculator. 


3-2 NUMERICAL EVALUATION OF PROGRESSIONS 

An arithmetic progression is defined by a sequence of numbers 
a n = a x + (n — \)d, (n an integer >0) 
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where a and d are real numbers. For = 3e and d= —m 

_ n _ _ a n 

1 8.154845484 

2 5.013252830 

3 1.871660176 

4 - 1.269932478 

5 -4.411525132 

A common problem is to compute the sum of the arithmetic progression to 
n terms: 


S n (d) = a + (a +d) + (a + 2d)+ • • • + [a + (n - \ )d] 

There are two formulas for computing the sum of an arithmetic progres¬ 
sion. The first is 


S n (d) = na+ in(n- 1 )d 

which can be rewritten in nested parenthetical form for easy evaluation on 
the pocket calculator as 


= («-!)) 

Another formula for computing the sum of the arithmetic progression to n 
terms is 


S„(rf)=f(a + 0 

Here the last term in the series l is 

l—a + (n — \)d 

We note that this equation is already in a form that can be easily evaluated 
on the pocket calculator. 

The geometric progression is defined by a sequence of terms of the form 
a n = a l r n ~\ 


(n an integer >0) 
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where a and r are real numbers. For a x = 3e and r- — tt 

_ a _n _ 

1 8.154845484 

2 —2.561920267 X 10 1 

3 + 8.048509891 X 10 1 

4 -2.528513955 X10 2 

5 + 7.943560867 X 10 2 

The sum of the geometric progression to n terms is 

S n = a \ + a x r + a x r 2 + a x r 3 + a x r 4 + • • • +a { r n ~ l 

It can be computed with the formula 

«,(1-r") a x -rl 
hn 1 - r 1 — r 

where / is the last term. If r< 1 in size, then as n-> oo 

a, 

lim (S n )= - - 

n->oc V n} 1 - r 

since the last term /->0.The sum of the geometric progression to n terms 
requires scratch-pad or memory storage. Table 3-1 shows a typical key 
stroke sequence needed for its evaluation and the required storage. 

Three types of means are encountered in advanced analysis—the arith¬ 
metic mean, the geometric mean, and the harmonic mean. Though they are 
all special cases of the generalized mean 

we are explicit here and write them out. The arithmetic mean of n 
quantities is defined by the equation 

a l + a 2 + a 3 + ••• + a n 


which can be computed conveniently (though not so easily as summing 
and dividing by the total number of samples) on the pocket calculator 



84 


Elementary Analysis with the Pocket Calculator 


Table 3-1 Typical Key Stroke Sequences for Evaluating 
the Sum of Terms in a Geometric Progression 


Algebraic 



Reverse-Polish 

(r) r 


(r) 

(1.0) 

y x / 

(1.0) 

t 

+ 

(n) 

1/* 

(«) 

- 5 - 

CHS / 

X 

RCL 

/ 

CHS 

1 aco 

+ / 

— 

(1.0) 

Note : Recall is 

(1.0) / 

1 5 1 

+ 

automatic in 



X 

Reverse-Polish 

STO / 


(r) 

for stack memory 

(/■) 


CHS 

(HP-35&21) but not 

CHS^ 



for register 

memory (HP-45,55&65) 


( )-*data entry, 
□—►output. 


using a recursion formula 

A r,*\=~^r[( nA n + a„ + ,) 

which can be developed from the equation for A as follows: 



n + 1 


y.a, 


A n +1 

1 = 


+ 


' =A„ + 


u n+\ 


n 

n 


A -+-^ a n+i=-ArT [”A n + a n+l ] 


n+\ n n +1 


n + 1 


An advantage to using a recursive “averager” is that the analyst can 
observe the convergence of the mean as he adds more terms to the 
calculation. He can thus often reduce the workload in computing an 
average by using only the numbers that are necessary to estimate the mean 
to the accuracy he desires. 

The recursive form is directly implementable, using the key strokes 
shown in Table 3-2. 
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Table 3-2 Typical Key Stroke Sequences for Recursive Arithmatic 
Averaging 


Algebraic 


Reverse-Polish 



( )—> data entry. 

□—> output 

o- mental step done by analyst. 

* Initial conditions can be A n = 0 when n= 0. 


The geometric mean of n quantities is defined by the relationship 

G = (a x a 2 • • • a n )' /n , (^>0, /' = 1,2,... ,n) 

which is easily calculated using the recursion formula for the geometric 
mean of n quantities as given by 


c„ +l =(a„ +l c ;) i/n+1 


and is developed as follows: 


n , \/n 


c„=(iu) 
c;=n«, 

G„"+/=(no, )(«„+,) 


n . \/n 


G; + + , 1/ ' , = («„ +] ) ,/ "(n«,) =ali\G„ 

G„ +l = al{\ + 'G:'"+' = (a n G" n ) Wn+ ' 
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A typical sequence of key strokes for evaluating this equation appears in 
Table 3-3. 


Table 3-3 Typical! Key Stroke Sequence for Recursive 
Evaluation of the Geometric Mean 


Algebraic 


Reverse-Polish 


G„ G n 



( )-^ data input. 

Q—» output. 

o —» mental step by analyst. 


The harmonic mean of n quantities is defined by 


1 = 1 
H n 


± + i. + 

a, a. 


+ 



( a f ->0 ,/ = 1 , 2 ,...,«) 


It, too, can be evaluated by using a recursion formula: 




w+1 


+ 


H„ 


- i 


The harmonic mean is evaluated using the typical key stroke sequence 
given in Table 3-4. 

Finally, the generalized mean is related to the geometric, arithmetic, and 
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Table 3-4 Typical Key Stroke Sequence for Recursive 
Evaluation of the Harmonic Mean 


Algebraic 


Reverse-Polish 


H n H n 



( )-^ data input. 

□—» output. 

mental step by analyst. 


harmonic means according to the relations 

lim M(t) — G 
/-> o 

M{\) = A 
M(— \) = H 


3-3 THE DEFINITION OF ABSOLUTE AND RELATIVE ERROR 

We discussed absolute and relative errors previously in the context of other 
matters. In the next chapter a number of errors are quoted; hence it is 
important to define precisely what is meant by absolute and relative errors. 
When x 0 is an approximation to the true value of x, we say the following: 

1. The absolute error of x 0 is hx — x 0 —x = (calculated — true). 
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2. The relative error of x 0 is Sx = Ax/x, (calculated — true)/true, which is 
approximately equal to Ax/x 0 . 

3. The percentage error is 100 times the relative error. 

If in (2) we use the approximation of the true value of x to estimate 
percentage error then in a sense there is a small error in estimating the 
relative error. 

The absolute error of the sum or difference of several numbers is at most 
equal to the sum of the absolute errors of the individual numbers. If it can 
be assumed that the errors occur in a random independent fashion, a more 
reasonable estimate of the error in computing the sum or difference of 
several numbers is root-sum-square error defined as 

/ \'/ 2 

(2 A*, 2 ) 

The relative error of the product or quotient of several factors is at most 
equal to the sum of the relative errors of the individual factors. Finally, if 
y = /(x), the relative error 


If we have 


Sy = ^L s l^ Ax 

y /(*) 


y=f(x„x 2 ,...,x„) 

and the absolute error in x,- is Ax, for all n , then the absolute error in / is 


A/s 


9/ A 

■aZ A *' + 


Bx-> 


Ax? + • • • + 


_9 / 

9x„ 


Ax„ 


Simple rules, similar to those for the relative error of a product or the 
quotient, can easily be derived for relative errors of powers and roots. It 
turns out that the relative error of an nth power is almost exactly n times 
the relative error of the base power, while the relative error of an nth root 
is 1 /nth of the relative error of the radicand. 


Calculations with Approximate Values 

Where they are developed from test experiments or from tables of 
characteristics of physical systems, data are usually inaccurate to some 
degree. In general, calculations made with data based on measurements 
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involve errors of some magnitude. Another type of error in calculating 
with approximate values is due to the use of numerical values of numbers 
that are truncated, producing roundoff errors. The maximum errors 
associated with these effects can be estimated. When the rounding is done 
correctly, the roundoff is at most one-half of the unit in the last place 
retained in the number. 

When the numbers are rounded, the addition of zeros after the last digit 
of the decimal fraction makes a difference. The number 0.98700 is stated 
with 100 times greater accuracy than 0.987. In the first case the number 
implies that at most its error is 5x 10 -6 . In the second case, the error can 
be as large as 5xl0~ 4 . The implications of accuracy should be stated 
precisely when tabulating results computed on the pocket calculator. 

The error due to a calculation that results from an inaccuracy of the 
data is known historically as “error of data.” The error introduced into the 
calculation by way of approximation associated with the limitations of the 
machine or field of numbers being used in the calculation is historically 
called “error of calculation.” It is the objective of any calculation to make 
the error of calculation significantly less than the error of data. For¬ 
tunately, for most pocket calculators the size of the numbers that can be 
contained is so large that the error of engineering calculation is almost 
always substantially smaller than the error of engineering data. 

When good computing practice is followed, care must be taken when 
computing the difference between nearby numbers. Since occasionally the 
magnitude of the error of calculation is found to determine the method of 
the calculations to be done, we are interested in estimating from the error 
of data the maximum error to be expected in the result of the calculation 
due to this error in the data. It is for these reasons that we give the rules 
for computing the absolute errors of the sums, differences, products, and 
quotients of numerical calculations. These formulas can be used to answer 
the questions about the size of the error-of-data from which can be 
determined whether the error-of-calculation will be on the same order of 
magnitude or smaller. Furthermore, if the error-of-calculation is less, it can 
be used to guide the analyst in how much the error-of-data will limit the 
accuracy of a calculation. This will indicate the accuracy remaining after a 
complex or involved calculation. 

The results of a calculation are the most inaccurate when the difference 
of two nearly equal and only approximately known numbers are involved. 
To determine the relative error in these cases, the sum of the absolute 
errors, taken without regard to sign, is divided by the difference of the two 
numbers involved (a small number that can turn even a small absolute 
error into a large relative error). 
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3-4 INFINITE SERIES 

We will have many occasions to use Taylor’s formula for a single variable 
as given by the expression 

f(x + h)-f(x) + hf'(x) +^/'(*)+-+ 

This equation has an error formula that can be written in three typical 
forms: 


R„=^r(x+o,h), (o<0,<i) 

7(l-f> 2 r'f(*+M), (0 < 0 2 < 1) 

(« — 1 )! 

Rn= i~^~K\ \'(\~ t )"~'r( x+th ) ctt 

( n - 1)! A) 

The truncated version of the series can be expanded in nested parenthe¬ 
tical forms for convenient numerical evaluations when the numerical 
values of the derivative either are given or can be quickly computed. 

fi =/(*) 


/ 5 =/W^(/' + f(/" + f(/'" + ^f))) 


/„=/«+*!/'+f r+f 




/""+■ 


n — 1 \ n 
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Taylor series expansions of /(x) on the point a are given by the 
expression 


(x-a) (x-j) 

f( x )=f(a) + (x-a)Aa)+^ r - L f"(a)+--- + 


n - 1 


(«— 1)! 


-r-'(a)+R n 


where the remainder formula is given by 


(x - a) 

This expression, too, can be written in nested parenthetical form as 

/ (x — a) I (x — a) ( 

In =/(«) + (*-*)( /'(«) + ——(/"(«) + —3— I /"(*) + * * * 


+ 


(x-a) 
n- 1 




(x-a) 

n 



Binomial Series 

The binomial series is encountered many times in combinatorial analysis 
as well as in the formulation of difference equations for numerical analysis. 
The binomial series can be written in the general form 

0 + x) a = 2 | (— 1 < x < 1) 


where 



a! 

(a — k)\k\ 


Particular series of interest are 


a a(a — 1) a(a — l)(a-2) 

(1 + x) = 1 + ax H —7 — -x 2 H -^ r --x 3 + 


2! 


3! 


which can be written in nested parenthetical form for easy pocket calcula- 
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tor evaluation as 


(1 + x) = 1 + ax 1 + 


x(ot- 1 ) / 1 + x(a-2) 


x(a — 3) 
1+— : -- + 


+ 


x(a — n+ 1 ) / x(a — n) 


n- 1 


1 + 


Other frequently encountered binomial series are the following: 


(a) 


(1+x) 1 = \- x + x 2 -x 2 +X 4 -x 5 + • • • , 

• (1 + x) *= 1 -*(l-x(l - x (1 -x(\-x(- • • ,))))) 


(c) 

(d) 

(e) 


(l + x) V 2 = i + |_^ + ^_^ + 7g_2lg 

•••(i+-) ,/2 =i + f(i-f( 1 -f( 1 -y-( 1 '-.))) 

+ 


(W<1) 

(W<1) 

(W<I) 

(W<1) 

(W<1) 

(W<1) 

(W<1) 


Operations with Truncated Forms of Infinite Series 

An integral part of advanced analysis on the pocket calculator is the 
numerical evaluation of truncated series. Generally, the approach is to 
truncate the series at something on the order of four terms and use the 
series to evaluate the function over the region that has a good fit with the 
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function being considered. Once a series is generated, whether with Che- 
byshev polynomials, Taylor series, the binomial series, Legendre 
polynomials, or some other means, such operations can be performed on 
the series as inverting the series, taking the square root of it, squaring it, 
multiplying or dividing it, taking the exponential of it, or taking the 
logarithm of it. This is conveniently done by manipulating the coefficients 
in the series. These operations are tabulated in Table 3-5 for the three 
series 


s l = 1 + a x x + a 2 x 2 + a 3 x 3 + a 4 x 4 + • • • 


s 2 = 1 4- b x x + b 2 x 2 + b 3 x 3 + b 4 x 4 + • • • 


s 3 — 1 + c,x + c 2 x 2 + c 3 x 3 + c 4 x 4 + • • • 

Among convenient series manipulations is the reversion of series, where 
the dependent variable is solved in terms of the independent variable. 
Given the series 


y = ax + bx 2 + cx 3 + dx 4 + ex 5 +fx 6 

we can write x as a function of y as 

x &Ay + By 2 + Cy 3 + Dy 4 + Ey 5 + Fy 6 

where 


A = - 
a 


*-4 

a 


C — 


2b 2 — ac 


D = 


5 abc — a 2 d— 5 b 3 


E = 


6a 2 bd+ 3<2 2 c 2 T \4b 4 -2\ab 2 c — a 3 e 


F = 


7a 3 be + la 3 cd+ 84 ab 3 c - a 4 f- 28 a 2 bc 2 - 42 b 5 - 28 a 2 b 2 d 



Table 3-5 Series Operations 


Operation 
j 3 = 5T 

S 3 = S 1 S 2 
J 3 = *l/ J 2 
s 3 = 

s 3 = 1 +ln(^,) 


C\ c 2 


c 3 


c 4 


na x \(n—\)c x a x + na 2 c x a 2 {n — 1)+ 1)(« — 2) + 

na-> 


a x + b x b 2 + a x b x + a 2 b 3 +a x b 2 +a 2 b x +a 3 


a x -b x a 2 — (b x c x + b 2 ) a 3 -(b x c 2 + b 2 c x + b 3 ) 


a\ 

a \ a 2 H 

a x c x 


a\ 


a 2 - 


a\ 

a 3 +a x a 2 + -jr- 


a 3 - 


(a 2 c x +2a x c 2 ) 


na 4 + c x a 3 (n —1) + 
±n(n-l)a} + 

Un-\){n-2)c x a x a 2 + 
j 4 (n-\)(n-2)(n-3)c x a 3 x 


b 4 + a x b 3 + a 2 b 2 + 
a 3 b x + a 4 


a 4 — (b x c 2 + b 2 c 2 + 
b 3 c x + b 4 ) 
a 2 

a 4 + a x a 3 + — + 


a 2 a\ 

2 


+ 


24 


(a 3 c x + 2a 2 c 2 + 3a x c 3 ) 

a 4 - 


2 


4 
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Transformation of Series 

Occasionally, slow-converging series are encountered in numerical analysis 
where the object is to compute the sum of the series to high accuracy. 
Usually we would use some form of economization to improve the ac¬ 
curacy of such a series (see Chapter 8). We may, however, also know 
another series that can be used to improve the convergence (accuracy) of 
the original series. This is convenient when numerically evaluating the sum 
of a slowly converging series of the form 


00 

S= 2 a k 
k = 0 

where it is known that the series does in fact converge and where we have 
another series 


00 

<=- 2 Ck 
k= 0 

which is also convergent and which we know to have the sum c and the 
limit of a k /c k as k approaches infinity to equal X (where X is not equal to 
zero); then 


This technique is known as Kummer’s transformation. It transforms one 
series into another that is more convenient for numerical evaluation. While 
not developed originally for this purpose, it turns out to be quite useful in 
numerical evaluation of slowly converging series. 

Another approach to numerically evaluating a truncated series is to use 
the Euler-Maclaurin summation formula. This is another technique for 
numerically evaluating series using another series that converges more 
quickly. Provided that the difference of derivatives at the end points of the 
interval over which the series is being evaluated is small, the Euler- 
Maclaurin summation formula is 

51 = 2 /*= f f{k)dk— i(/ 0 -/„) + i2 {In )— fo l) ) 

k = 1 ■'0 

( f(V)_ wv>\ 


30240 



96 


Elementary Analysis with the Pocket Calculator 


3-5 THE SOLUTION OF POLYNOMIALS 

The numerical solution of a polynomial on the pocket calculator involves a 
clear understanding of the possible location of the polynomial’s roots in 
the complex plane. For this reason, we take a few moments to refresh our 
understanding of algebraic equations. It should be remembered that an 
/jth-order algebraic equation has n roots. If the coefficients in the poly¬ 
nomial are real, the roots of the equation are either all real, some being 
equal and some not, or have pairs of roots that are complex conjugates of 
each other and other roots that are real with various locations on the real 
axis. The occurrence of complex roots in complex conjugate pairs arises 
from our assumption that the coefficients in the polynomials are real, not 
complex. If the coefficients are complex, of course, the roots can occur 
anywhere in the complex plane. In this book we concern ourselves only 
with polynomials that have real coefficients, since they are the most 
frequently encountered algebraic equations in engineering analysis. 


The Solution of Quadratic Equations 


If we are given a quadratic equation of the form 

az 2 + bz + c = 0 

its roots can be numerically evaluated with the formula 


2 i = 



y i 

2a 


where 


z 2 — 



y? 

2 a 


q = b 2 -4ac 


From time to time we will make use of the following easily verified 
properties of the roots: 


c 

z x z 2 = - 
1 2 a 

It is apparent from the equations for the two roots that 

1. If <7>0, the two roots will be real and unequal. 

2. If q = 0, the two roots are both real and equal. 
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3. If *7 <0, the roots occur in complex conjugate pairs. 

The numerical evaluation on the pocket calculator should involve first 
the calculation to determine q and then the use of equations for the roots 
for their evaluation once the situation of the roots is determined. 


Solution of Cubic Equations 


If we are given a cubic equation of the form 

z 3 + a 2 z 2 + a x z + a 0 = 0 


the first step in computing its roots is to calculate q and r: 

a i a l 


Then: 


a l a 2 ~^ a 0 a\ 
6 27 


1. If < 7 3 + r 2 >0, the cubic equation has one real root and a pair of 
complex conjugate roots. 

2. If q 3 + r 2 = 0, all the roots are real and at least two are equal. 

3. If q 3 + r 2 < 0, all roots are real and unequal (the irreducible case). 

Once the nature of the roots is known, it is a simple matter to use the 
following equations to evaluate the roots on the pocket calculator. First, 
compute 

, 1 =[ f+( ^ + ^) 1/2 ] ,/2 

W) l/2 ]' /2 

Then the roots can be calculated from an understanding of their nature 
and the following three equations: 

*i = Ui + J2)-y 


-(■ s , + 5 2 ) a 2 jV3 

z i= -1-T + "^r- ( - v i 


(s, + ^ 2 ) a 2 jV 3 , 

—-y + — 


2 
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Note that if q 3 + r 2 — 0, s x will equal s 2 and the imaginary component of the 
roots will drop out, leaving the two z roots, z 2 and z 3 , equal, while z x may 
not necessarily be equal, depending on the value of s 2 . 

Once the roots of the cubic equation are evaluated, they satisfy the 
following relations: 


Zi + Z 2 +Z 3 =-tf 2 

Z { Z 2 + Z X Z 3 + Z 2 Z 3 = a \ 

Zl z 2 z 3 =-a 0 

These relations can be used as a check on the calculation of the roots. 

The process of numerically evaluating the roots of the quartic equation 
is somewhat involved, even for pocket calculator evaluation. Under some 
conditions, however, simple evaluations can be made. For example, con¬ 
sider the quartic equation 

z 4 + a 3 z 3 -I- a 2 z 2 + a x z + a 0 = 0 

One approach to evaluating the roots of this quartic equation is to find the 
real root of the cubic equation 

(x 3 ~a 2l x 2 + ( a x a 3 - 4a 0 )p - ( a\ + a 0 a 2 - 4 a 0 a 2 ) = 0 

and then determine the four roots^of the quartic equation as solutions to 
the two quadratic equations 



u 2 + 




1/21 




1/2 


= 0 


Once the roots of the quartic are evaluated and can be written in the form 
z 4 + a 3 z 3 + a 2 z 2 + a x z + a 0 =(z 2 +p x z + q x )(z 2 +p 2 z + q 2 ) 
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the following conditions hold: 


Pi+p 2 = a 3 


P\P 2 + qx + q 2 = a 2 


Pi<h+Piq \ = a \ 


q\<h = a o 

Finally, if z ]} z 2 , z 3 , z 4 are the roots of the quartic equation, the following 
conditions hold among the roots: 

z j + z 2 + z 3 + z 4 = — a 3 

2 

z l z 2 z 3 z 4 = a 0 

Again, these conditions can be used to check on the calculation of the 
roots. 

The evaluation of the roots of a polynomial up to quartics is tedious and 
usually inaccurate (at best) on a slide rule, by hand analysis, or even on the 
old mechanical calculators (though accurate); it is a relatively fast and 
accurate process on the pocket calculator, however. 

3-6 SUCCESSIVE APPROXIMATION METHODS 

Again, we are concerned with the problem of determining the roots of an 
equation, but the equation is of a more general form. We are looking for 
the condition 


/(*) = 0 

That is, we are looking for the values of x such that f(x ) will equal zero. In 
this case f(x ) need not be a polynomial in *. If we let x = x n , the 
approximation of the root, then when f n is not equal to 0 it is equal to e 
(the error). If we now use e to update our estimate of the root, 


^ x = c n £n = eJn 
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we can write 

x„ + i = x n + cj(xj, (n = 1,2,3 — ) (3-1) 


When it is found that f'(x) is greater than or equal to zero and the 
constants c n are negative and bounded, the sequence of x n converges 
monotonically to the root x = r. If c is a constant less than zero and /' is 
greater than zero, the process converges but not necessarily monotonically. 
A number of approaches have been developed to compute c n . Among these 
are the regula falsi method, the method of successive iterations, Newton’s 
method, and the Newton-Raphson method. The regula falsi method begins 
with the assumption that we are given y — f(x); the objective is to find x = r 
such that f(r) = 0. We choose a pair of values of x, x 0 , and x Y such that 
f(x 0 ) and /(-xq) have opposite signs. Then equation 3-1 can take the form 


x i -M, _Ji x o jo- 

r f-/o 


(3-2) 


The third- and higher-order estimates of the root x n are computed using x 2 
and either x 0 or x x for which f(x 0 ) or f(x { ) is of opposite sign to f(x 2 ). 

This method is equivalent to an inverse interpolation. This is apparent 
from the form of equation 3-2. 

In the method of successive iterations, the approach is to write the 
equation in an implicit form and use successive iterations to solve the 
equation x= F(x). The iteration scheme is to compute 

X n + \=f( X n) 


The sequence of solutions to this implicit equation will converge to a zero 
of x = F(x) if there exists a q such that 


and 


|/'(*)| < q < 1 for a < jc < b 


a 


x n ± 


f( x o) ~ *ol 

1 ~q 


<b 


This is an attractive method for use on the pocket calculator because it 
does not involve remembering special formulas such as those associated 
with the regula falsi or the Newton (Newton-Raphson) methods. The 
problem encountered in applying the method of successive iterations on 
the implicit form of the equation whose roots are to be determined is that 
the implicit equation may not converge as quickly as other methods based 
on additional information (such as the derivatives of f(x)) whose function 
it is to ensure rapid convergence of the method. 
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Newton’s method is to compute recursively estimates of the roots of the 
function /(x) using the formula 


/w 


(3-3) 


where x = x n is an approximation to the solution, x + r , of /(x) = 0. The 
sequence of solutions generated with Newton’s rule will converge quadrati- 
cally to x = r. The condition for monotonic convergence is that the product 
f(x 0 )f"(x 0 ) is greater than zero, and f(x) and f"(x) do not change sign in 
the interval (x 0 , r). The conditions for oscillatory convergence are also 
straightforward. When the product x(x 0 )/"(x 0 ) is less than zero and f'(x) 
and f"(x) do not change sign in the interval (x 0 ,x,), equation 3-3 will 
converge, though it will oscillate. These conditions only hold, of course, 
when 

x 0 < r < Xj 


When Newton’s method is applied to the evaluation of th roots, we find 
that given x n — N, if x k is an approximation of x= N l/n then a sequence of 
improved x k can be generated: 


i = - I x t 

n 


N 


+ n- 1 


This method will converge quadratically to x for all n, and is particularly 
useful for computing the n\h roots iteratively on the four-function pocket 
calculator as covered in Chapter 1. It is derived here to show the proce¬ 
dure: 


1. We wish to compute x = (N) l/ ' n . 

2. Form f(x) = (x n — N) — 0 from (1). 

3. For Newton’s rule, x k + l = x k —f(x k )/f'(x k ), we need f(x k ) and f'(x k ). 
•4. f(x k ) = (x k n - N) and f\x k ) = (x/i*- 1 - 0). 

5. Substituting the results of (4) into (3) we find (6). 


6 . 


X k -b\ ~ x k 


x k ”-N 


nx. 


x£+N 


nx. 


N 


— +(n~l)x k 


l 

n 


Kv "- 1 


Details on finding the zeros of functions—an important subject in numeri¬ 
cal analysis are given in Chapter 9. 
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3-7 ELEMENTARY TRANSCENDENTAL FUNCTIONS 


In Chapter 1 we presented polynomial approximations for most of the 
transcendental functions found on the keyboard of the scientific calculator 
so that they could be evaluated on the simple four-function calculator. Not 
presented there, however, were approximations in terms of Chebyshev 
polynomials. Because the approximation in terms of Chebyshev poly¬ 
nomials is a mini-max approximation (minimizes the maximum error on 
the interval - 1 to +1), they are accurate and useful, and for the sake of 
completeness they are presented here. Th,e discussion of the reduction of 
order of series approximations to function in terms of Chebyshev 
polynomials is covered in Chapter 8 and their numerical evaluation is 
covered more fully in Chapter 4. For now we concern ourselves with the 
numerical evaluation of the elementary transcendental functions using 
Chebyshev polynomials. 

Evaluating the natural log of y for y near zero can be difficult, at best. If 
y is near zero it is convenient to write 

H y) 

in the form 


ln( \+x),y = \ + x 
Then we can write ln(l + x) as 


ln(l + jc) = 


2 a„t„( x ), 


n = 0 


(0< X < 1) 


where the coefficients A n are 

_ A n 

0 0.376452813 

1 0.343145750 

2 -0.029437252 

3 0.003367089 

4 -0.000433276 

5 0.000059471 

6 -0.000008503 

7 0.000001250 

8 -0.000000188 

9 0.000000029 

10 - 0.000000004 

11 0.000000001 
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and where 

T n (x) = 2xT n _ l (x) - T (n _ 2) (x) 

T 0 

= 1 

T, 

= X 

In a similar way, we can use 

Chebyshev polynomials to evaluate both 

e x and e~ x 

00 

The coefficients for evaluating 

*■'= S \T n (x) are 

n—0 

n 

a. 

0 

0.64535270 

1 

— 0.312841606 

2 

0.038704116 

3 

-0.003208683 

4 

0.000199919 

5 

-0.000009975 

6 

0.000000415 

7 

-0.000000015 

The coefficients for evaluating 

e x are 

n 

A n 

0 

1.753387654 

1 

0.850391654 

2 

0.105208694 

3 

0.008722105 

4 

0.000543437 

5 

0.000027115 

6 

0.000001128 

7 

0.00000040 

8 

0.000000001 

Again the restriction x is that 

0< x< 1 
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Now that we have a few example functions to work with, consider the 
procedure for using the Chebyshev polynomials to numerically evaluate 
these functions: 

Step l Let the objective be to evaluate e x near x = x 0 j where x 0 is not on 
the interval 0< x< 1. Rewrite e x so that the exponent is on the 
interval [0,1]. For the sake of this discussion, we use 

e x = e (x~ x 0 ) + x 0 _ e x 0 [ e (x- x 0 )j __ e x 0e y 

Then for x on the interval 

x 0 < x < x 0 + 1 

y is on the interval 

0< y < 1 

Step 2 Select x and compute y. 

Step 3 Compute T x =y. 

Step 4 Compute T 2 = 2yl\-\. 

Compute Tj = 2 y T 2 - 7j. 


Compute T„ = 2yT n _ l *~-T„_ 2 . 

n 

Step 5 Compute e y = using the appropriate A n . 

o 

Step 6 Compute e x — e x °e y . 

Usually x 0 is chosen to be a convenient number for precise evaluation of 
e x ° using the prime factors method (presented in Chapter 1). For example, 
if x = 100, then e 100 becomes 

which is easily evaluated with a table lookup of e and 13 (at most) data 
entry key strokes plus 16 multiply key strokes on the four-function cal¬ 
culator. 
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The Chebyshev approximations for sine and cosine are given by the 
relation 

iA„T m (x*) 

X z ’ rt =0 

and 

OO 

^f) = 2 A „T„(x 2 ) 

using the coefficients for A n : 



Sine 


Cosine 

n 


n 


0 

1.276278962 

0 

0.472001216 

1 

-0.285261569 

1 

-0.499403258 

2 

0.009118016 

2 

0.027992080 

3 

-0.000136587 

3 

-0.000596695 

4 

0.000001185 

4 

0.000006704 

5 

-0.000000007 

5 

-0.000000047 


Here x must reside in the interval 


x| < 1 


Formulas for the Solution of Plane and Spherical Triangles 

Many elementary analysis problems involve the solution of triangles. These 
include plane right triangles, plane triangles, and spherical triangles. Con¬ 
sider the plane right triangle shown in Figure 3-1. Here A, B, and C are the 
vertices of the triangle and a , b and c are their opposite sides. Then 

• ^ a 1 

sinT = — =-- 

c esc A 

A b 1 

c sec A 
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Figure 3-1 Ri ght triangle. 


Now consider Figure 3-2. This plane triangle has angles A, B, C and 
sides opposite a,b,c. The law of sines states that 

_a b _ c 

sin ,4 sin B sinC 

and the law of cosines is 


cos^ = 


' + b 2 


2 be 


Also, the following four relationships hold for plane triangles: 
a ~ b cos C + c cos B 

a+_b tanj(/t + fl) 

a — b tan i(/l — B) 


area = 


bcsmA 

2 


area = [ 5(5 - a)(5' — b)(s~ c)] 2 

where s = %(a + b + c). 

Figure 3-3 shows a spherical triangle with angles A, B, C and sides 
opposite a, b, c. The four commonly used formulas in spherical tri- 
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B 



Figure 3-2 Plane triangle. 


gonometry are 

sinyj _ sinB _ sinC 
sin a sin b sine- 

cos a = cos 6 cos c 4- sin b sin c cos A 

cosbcos(c ± 9) 

cos a— -—- where tan 9 = tan bcosA 

cos (9) 

cos A = — cos B cos C + sin B sin Ceos a 

In solving spherical triangle problems we can use either the scientific 
keyboard function evaluation or, on the four-function calculators, the 

B 



Figure 3-3 Spherical triangle. 
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polynomial or Chebyshev approximation to the transcendental functions 
involved. These developments for the right triangles are shown here not so 
much because of their unique form for pocket calculating, but because 
they are very frequently encountered in almost all forms of engineering 
analysis and, again, are provided here for the sake of completeness. 


3-8 COMPLEX VARIABLES AND FUNCTIONS* 

In the remainder of the book, and in Chapter 4 in particular, the equations 
and formulas used for analysis on the pocket calculator hold both for real 
and complex variables. In this section, we touch briefly on analysis with 
complex variables. 

Complex variable analysis on the pocket calculator results in nothing 
more than keeping track of the real and imaginary coordinates either in 
polar or in Cartesian form. Pocket calculators with conversion from 
rectangular to polar make the analysis with complex variables particularly 
easy. Since virtually all advanced scientific calculators have this feature, 
we assume here that it is present. The formulas given here for analysis with 
complex variables and for the evaluation of functions of complex variables 
can be quickly and easily developed on the four-function calculator using 
the trigonometric functions developed earlier in this chapter or in Chapter 1. 

The addition and subtraction of two complex variables are simply 
defined by 

(*1 +/>l) + (* 2 + *> 2 ) = (*l + * 2 ) + K.V 1+V 2 ) 

(*1 + iy\) ~ (*2 + *> 2 ) = (*1 - * 2 ) + i(y\ ~y 2 ) 

Multiplication is more conveniently done in polar coordinates; that is, 
(■*1 + *>i)(* 2 + *> 2 ) = r x r 2 e ii0 ' + 9l) 


where 

X, +iy, = 

x 2 +iy 2 = r 2 e^ 


Here r is the positive root sum square of the imaginary and real com¬ 
ponents of the complex number v = (x 2 +y 2 Y /2 and 

y 

9 = inverse tan — 
x 


*See Appendix 3 and Appendix 4 
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The division of two complex numbers is given by 

X| + ^' (x+iy^0) 

x 2 + iy 2 r 2 


Or, in rectangular form, 

X 2+Vi x l+yj xj+yj 


where the denominator is developed by multiplying the numerator and 
denominator by the complex conjugate of the denominator. 

We will frequently encounter certain commonly used functions of com¬ 
plex variables (complex functions). The most often occurring one is the 
modulus (absolute value) of a complex number, which is defined by 


|* + />| = (x 2 + y 2 ) 1/2 

Another commonly encountered complex function is that of the square 
of a complex number, which is simply given by 


(x + iyf = (x 2 -y 2 ) + i{2xy) 


Not so easily remembered but occasionally encountered is the square root 
of a complex number, which is given by 


Vx + iy — ± 


1 + (x 2 +y 2 ) 


1/2 




x + x(x 2 +y 2 ) 


1/2 


- 1/2 


Clearly, the powers and roots of a complex function can be more easily 
evaluated in polar coordinates. Thus, in general, 


and 


(* + iy) n = r n e in9 = r n (cosn0 + i sin nO) 

( . . \ l/n ]/„ / 6 + 277k \ . ( 0 -\-2ti k\ 

(x + iy) =r l/n cosf-j + /sm!-1 


These formulas are written for angles in degrees, not radians, and for 
angles where n is an integer greater than zero and k takes on any integer 
values from zero n— 1. The only restriction on these two complex 
functions is that the complex variable cannot equal zero. 
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Exponential and logarithmic functions of complex variables are also 
easily developed when the complex variable is written in polar form: 

x + iy — re ,e 

Then it is straightforward to develop 

e {x + iy) = e x (cosy + isiny) 

Similarly, the natural log of a complex variable is given by 

ln(x + iy) = lnr+ i9±2mk, (k = 0,1,2,* * •) (3-4) 

These relationships can be generalized to any base according to the 
relation 


a (x + iy ) __ g{x + iy)lna 


log a (x+i» = 


ln(x + iy) 
In a 


Even more generally, the complex powers, complex roots, and complex 
logarithms of a complex variable can be developed, again using polar 
coordinates. If we are given 


z = x + iy and w=v+iv 


then the complex powers and roots of a complex number are given by 


Finally, 


z w _ e w\nz 

(3-5) 

Z l/w = e (\nz)(l/w) 

(3-6) 

, . M*) 
log.M- , . 

ln(z) 

(3-7) 


In equations 3-5,3-6, and 3-7 we can use equation 3-4 for taking the 
natural log of a complex number. 

Complex trigonometric functions are often presented in complex vari¬ 
able theory books more from the standpoint of derivation and develop¬ 
ment than from the standpoint of numerical evaluation. Hence the numeri¬ 
cal evaluation formulas sometimes get buried in the derivation. Here we 
present the formulas for complex trigonometric functions in a form that is 
easily evaluated on the pocket calculator. No attempt is made to derive 
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these formulas, interesting though they are, because again the emphasis 
here is on numerical evaluation of the functions. 

The most straightforward expressions are the sine, cosine, and tangent 
functions of the complex variable 

z = x + iy 

Then 

sinz = sin x coshy + / cosx sinhy 

cos z = cos x coshy - u sin x sinhy 

sin2x + /sinh2y 

tan z =---—— 

cos2x + cosh2y 

Less straightforward are the complex inverse trigonometric functions. 
Again, when 


z = x + iy 

the inverse sine of z is given by the relation 

sin _1 z = fciT + (— 1 )* sin “ */? + (— l)*/sgn(y)ln[a + (a 2 — 1) 1/2 ] 

In this formula 

« = jV(* + OV + iV(*- 1 ) 2 +>’ 2 (3-8) 


P = {^(x+xf+y 2 - i]f(x- y+y 2 (3-9) 

and the function sgn(y) is given by the relation 

sgn(y)=( 1 lf y> ° (3-10) 

l-l if y<0 

Finally, k in this formula is an integer. A convenient simplification for 
pocket calculator analysis is to take into account the fact that the inverse 
trigonometric functions are multiple valued and thus the /c = 0 case (the 
easiest to evaluate numerically) can be used to evaluate the inverse sine of 
z when care is taken to account for the “quadrant” in which z is being 
determined. Then sin _, z simplifies to the form 

sin ~ 1 z = sin “ 1 /S + / sgn(y) In [ a + ( a 2 - 1) 1 ^ /2 ] 
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Similarly the inverse cosine can be numerically evaluated from the 
equation 

cos -1 z = cos -1 /? — /sgn(y)ln[a + (a! 2 - l) 1/2 j (3-11) 

where a and /? are given by equations 3-8 and 3-9 and sgn(y) is given by 
equation 3-10. Here, as before, k is assumed to be zero. However, were k 
not equal to zero the more general form of equation 3-11 is given by 

cos -1 z=2/c7r ± |cos ~ l ft- wsgn(y)ln[ct + (a 2 - 1) 1/2 ]} 


Finally, the inverse tangent in its most general form is given by 

1-y 


tan 1 z = j 


+ > 


(2k + 1)77 — tan ——• j 

(1 +yf + x 2 


— tan” 


(\-yy+x 2 

which when k = 0 simplifies to the form 


tan 1 z = — \ 


tan" 


+ tan" 


mW 




(i + y y+x 2 
(1 -yf + x 2 


With these relationships; it is a simple matter to define the complex 
hyperbolic and complex inverse hyperbolic functions in terms of the 
trigonometric functions and their inverses: 

sinhz= — / sin iz 

cosh z — cos iz 

sinh 2x -1- i sin 2 y 
tanh z cosh 2 jc -l- cos 2y 

Similarly, the inverse hyperbolic functions are defined as 

sinh _1 z= — i sin -1 iz 
cosh - 'z = i cos -l z 
tanh -1 z = — i tan -1 iz 
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Other complex trigonometric relationships useful in evaluation of complex 
functions are 

cscz = (sinz) 1 

secz = (cosz) 1 

sin 2.x — i sinh2>’ 

coton z =-—-r— 

cosh 2y — cos 2x 

csc -1 z = sin -1 (z -1 ) 

sec -1 z = cos -1 (z _l ) 

coton -1 z = ^ — tan -1 (z) 

csch z = i esc iz 

sechz = sec/z 

sinh2x— /sin 2^ 

coth z =- — -r— 

cosh 2 x — cos 2 y 

csch"" 1 z = /esc -1 iz 

sech -1 z = /sec -1 z 

coth _1 z = / cot -1 iz 
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CHAPTER 4 


NUMERICAL EVALUATION 
OF ADVANCED FUNCTIONS 


4.1 INTRODUCTION 

Even the simplest pocket calculator can evaluate advanced mathematical 
functions to accuracies required in engineering use and certainly carrying 
as many significant digits as do the typical tables in mathematical 
handbooks. In part, then, this chapter deals with freeing the analyst from 
having to carry or have access to extensive tables to numerically evaluate 
the advanced mathematical function. Among the advanced functions con¬ 
sidered in this chapter are the exponential integral, the gamma function, 
the error function and Fresnal integrals, Legendre’s polynomials, Bessel 
functions of integer and fractional orders. Confluent hypergeometric func¬ 
tions, Chebyshev polynomials, hypergeometric functions, Hermite 
polynomials, and Laguerre polynomials. Again, we stress not so much the 
analysis with these functions and their analytical properties as their 
numerical evaluation on the pocket calculator. 

There are three methods for numerically evaluating advanced mathema¬ 
tical functions: 

1. The function is approximated by a polynomial approximation or 
curve fit that permits accurate evaluation of the function directly through 
analytic substitution. 

2. If the function is one of a sequence of generated polynomials, the 
low-order polynomials can be determined for the argument of the function 
and the higher-order polynomials then numerically evaluated by means of 
the recursion formulas. 

3. Successive partial sums of the series that describes the advanced 
function are computed. 
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The first method has the greatest body of mathematical literature. It also is 
the simplest to apply in numerically evaluating advanced functions in that 
it involves the simple procedure of evaluating a polynomial. The third 
alternative is the least attractive, since rapidly converging series are often 
difficult to develop over all intervals of interest. An example of this is the 
Bessel function where a number of series can be written that converge 
quickly in certain intervals, but there is no one series that converges 
quickly over the entire range of the independent variable in the Bessel 
function. In fact, the Bessel function requires the same consideration in 
developing polynomial approximations. To get precision, polynomial 
approximation with a reasonable number of terms requires more than one 
polynomial approximation to span the interval of the independent variable 
from minus infinity to plus infinity. 

Finally, the second approach (the use of low-order polynomials to 
determine the argument of the advanced function and then employing 
recursion formulas to numerically evaluate higher-order polynomials) is 
used extensively in the generation of accurate mathematical tables and 
thus is handy for pocket calculator analysis, though somewhat tedious at 
times. 

In this chapter, all three approaches are used, each where appropriate 
for evaluating the advanced mathematical functions covered here. Care 
has been taken to select, from the number of available numerical methods 
for evaluating these functions, those methods that can be implemented 
with a minimum amount of work on pocket calculators, and particularly 
the four-function variety. 

Where tradeoffs are difficult the method that leads to the quickest 
evaluation has been selected. A specific example would be the Chebyshev 
polynomials which are evaluated here using recursion formulas, rather 
than sine and cosine functions, which are available on most scientific 
calculators. The reason for this and a similar situation in the half-integer 
Bessel functions is that the approach presented here can be numerically 
calculated on the four-function calculator which does not have the sine 
and cosine functions. 

Those who have done the evaluated certain mathematical functions on 
large-scale digital computers should recognize that the methods chosen 
here are not necessarily the same as those commonly used on large digital 
computers. Partly the numerical methods are chosen for the pocket cal¬ 
culator and partly because the methods are to be instructive to students 
and working engineers who may have been away from the application of 
these functions. For these cases, familiar numerical methods are often 
chosen, though they require slightly more work than the numerical 
methods used for large computer numerical evaluation. It is important to 
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remember that in pocket calculator analysis a mathematical function is 
usually evaluated only a few times, while on a large digital computing 
machine it might be evaluated a great many times. For large machines, the 
emphasis is on maximizing the accuracy with the minimum number of 
steps involved in the subroutine. In pocket calculator analysis, the empha¬ 
sis is more on understanding the method and providing accuracy consis¬ 
tent with the display in the pocket calculator, on a one-time basis. Thus the 
requirements for a numerical method for pocket calculator evaluation are 
significantly different than those for large computer evaluations. 


4.2 EXPONENTIAL, SINE, AND COSINE INTEGRALS 

Four commonly encountered integrals are two forms of the exponential 
integrals and the sine and cosine integrals. The exponential integrals that 
we discuss here are of the form 

AW = jf ^Y dt > «argz|<„) 

/•“ p-< r x p' 

Ei(x) = - I - dt — I — dt, (x>0) 

J-X t Too ^ 

More generally we are also interested in the exponential integral 



C 00 e ~ 2t 

( V 

(n = 0, 1,2, ...,Re(z)>0) 

methods for numerically evaluating both 

r oo 


+ 

N 

(n = 0,1,2,...,Re(z)>0) 

AW- J 

1 t n e~ 2t dt, 

- 1 

(n = 0,1,2,...) 


Though each of these integrals is defined for complex arguments, our 
interest here is primarily with their evaluation for real arguments. How¬ 
ever, it is usually assumed that the path of complex integration does not 
include the origin, nor does it cross the negative real axis. 

The approach to numerically evaluating the exponential integrals is first 
to have a means to numerically evaluate them for n— 1, and then use 
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recursion formulas for numerically evaluating the higher-order exponential 
integrals. There are two approaches to evaluating the integrals: the use of 
either an infinite series or rational polynomial approximations (see Chapter 
8). The series expansions for these functions are 


Ei(x) = Y + lnx+ , (x>0) 

n = 1 


E\( z )= - Y-lnz- 2 


(“ 1 )*" 


(I argzj < tt) 


n = 1 


E n( z ) = 7 -^-, 7 ,- [ - In z + xp(n)]~ 2 


(-*)' 


(n~ 1)! 


(m — n+ \ )m\ 

m^n — 1 


(I argz| < w) 


a n {z) = n\z ( " +n e z |2frj 




Here 


4/(1)= — y 


n — 1 


4/(At)=-Y+ 2 T7 


m = 1 


and 


y = 0.5772156649 (Euler’s constant) 

These functions can be evaluated more conveniently, using rational 
polynomial approximations. 

For the dependent variable x on the interval zero to 1, the exponential 
integral can be evaluated with the polynomial 

£,(x) + lnx = a 0 + x(a x + x(a 2 + x(a 3 + x(a A + a 5 x)))) + e(x) 
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with an accuracy of two parts in 10 7 , using the coefficients 

a 0 = - 0.57721566 a 3 = 0.05519968 

a x = 0.99999193 a 4 = -0.00976004 

a 2 = -0.24991055 ^ = 0.00107857 

when x is on the interval 



|€ (at) | < 5 X 10 -5 

For even greater precision, the-exponential integral can be evaluated over 
the interval x greater than or equal to 10, using the same rational 
polynomial but with the coefficients 

a x =4.03640 b x = 5.03637 
a 2 = 1.15198 b 2 = 4.19160 


Here the error is given by the relation 

|e(x)| < 10 -7 

For x greater than 1, the rational polynomial approximation 

Jcr , / . a A +x(a^ +x{a 2 +x{a. x +x))) 

xe x EAx) =-- + e(x) 

b 4 + x(b 3 + x(b 3 + x(b x + x))) 
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using the coefficients 


flj- 8.5733287401 
a 2 = 18.0590169730 
a 3 = 8.6347608925 
a 4 = 0.2677737343 


= 9.573322454 
b 2 — 25.6329561486 

6 3 = 21.0996530827 

6 4 = 3.9584969228 


can be used to evaluate the exponential integral to an accuracy of 


e(jc)|<2xl0“ 8 


Once the exponential integral is numerically evaluated, the following 
recursion formula can be used to compute the higher-order exponential 
integrals for the same arguments: 


E n+l (z)=~[e z -zE n (z)l («= 1,2,3,...) 

zoL n (z) = e- z + na n ~ l {z), (n= 1,2,3, ...) 

zPn( z ) = (~ ^) n e z ~e~ z +n(i n _ x (z), (n= 1,2,3,....) 


The sine and cosine integrals are defined as" 

si(z)= 

Jo 1 

, /" z cos(0 —1 

C/(z) = 0 + lnz + I —-— —~dt, (|argz|<77-) 

Furthermore, we make note of the definition 

5/(z) = S/(z)--| 

Then two auxiliary functions can be developed that have the form 
/(z)= C/(z)sinz — j/(z)cosz 
g(z)= - Ci(z) cosz — si(z) sin z 
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Then the sine and cosine integrals can be written in terms of the auxiliary 
function as 

Si (z) = J —j(z) cosz - g(z) sinz 
Ci(z)~ /(z)sinz-g(z)cosz 

where the auxiliary integrals are defined according to the relations 


or 


and 


or 



dt 


dt 


dt 



dt 


subject to the condition for convergence of these integrals 


Re(z) >0 


The reason for doing this is that rational approximations to the auxiliary 
functions are easily developed with high precision. 

For four-place precision, the auxiliary function can be determined using 
the rational approximation 



' a 2 + x 2 (a 1 + x 2 ) 
, b 1 + x 2 {b l + x 1 ) 
e(*)| <2 X 10“ 4 


+*w 


, 1 / a 2 + x 2 (a t + x 1 ) 

^ x 1 1 b 2 + x 2 (b t + x 2 ) 
|e(jc)|< 10- 4 


+«w 


for x greater than 1 using the coefficients 


a, = 7.241163 
a 2 = 2.463936 
=9.068580 
£, = 7.157433 


a Y = 7.547478 
a 2 = 1.564072 
b x — 12.723684 
£,= 15.723606 
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For precision to five parts in 10 7 , the auxiliary functions can be approxi¬ 
mated with the rational approximations 

1 / a 4 +x\a 3 +x 2 (a 2 +x\a x +x 2 ))) \ 

* \ b 4 + x\b 3 + x\b 2 + x\b x + x 2 ))) / 

' U 4 +X 2 («3+X 2 (a 2 +X 2 («j + X 2 ))) \ 

v b 4 + x 2 (Z? 3 + x 2 (b 2 + x 2 (b l + x 2 ))) / 

for 1 < jc. The coefficients for f(x) are given by 

«,= 38.027264 b { = 40.021433 

a 2 = 265.187033 b 2 = 322.624911 

a 3 = 335.677320 b 3 = 570.236280 

a 4 = 38.102495 b 4 = 157.105423 

and for equation g(x) 

42.242855 b { = 48.196927 

a 2 = 302.757865 b 2 = 482.485984 

g 3 — 352.018498 b 3 = 1114.978885 

a 4 = 21.821899 b 4 = 449.690326 

The infinite series for numerically evaluating these functions are * 

00 (—l)”z 2/1+I 

5 ' (Z) = „?o(2«+D(2«+1)! 

OO 

Si (z) = 17 2 J S+i/ 2 (f) 

rz = 0 

and for the cosine integral 



C/(z) = y + lnz + ^ 
«=i 


(-OV* 

2n(2n)! 


*We shall see that fractional Bessel functions can be conveniently evaluated on the scientific 
pocket calculator. 



122 


Numerical Evaluation of Advanced Functions 


4.3 NUMERICAL EVALUATION OF THE GAMMA FUNCTION AND ITS 
RELATED FUNCTIONS 


The gamma function is defined by Euler’s integral 

T(z)= (Rez > 0) 

•'o 

or 


r(z) = k z l e~ kt dt, (Rez > 0, Re A: > 0) 
•'o 


Euler’s formula for evaluating the gamma function is of the form 


T(z) = lim 


n\n‘ 


z(z + l)(z + 2 )- • • (z + n) ’ 


(z t^O, — 1 , — 2 ...) 


He also gave an infinite product expression for evaluating the gamma 
function: 

00 


r( 


II (1 + 

(z) „-il' «' 


where 


7 = lim 


1 + \ \ + 7 + • • • + — - In w 

2 3 4 m 


= 0.5772156649- 


This number is known as Euler’s constant. Only a little analysis is involved 
to show that the gamma function is analytic and single valued over the 
entire complex plane except at the points z = —n (n — 0 , 1 , 2 ,...,) where its 
poles occur. The residues of these poles can be evaluated and are found to 
be 


(-D" 

n\ 

The reciprocal of the gamma function has zeros at the points z = — n 
(n = 0,1,2,...,). The recursion formula for computing the gamma function 
is given by the expression 

r(z + l) = zr(z) 

which is related to the factorial of z according to the relation 


r(z+l) = z! 
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It follows that the gamma function propagates from gamma at (1 + z) to 
gamma at ( n + z) according to the relation 

Y(n + z) = (n — 1 + 2 )! = (n — 1 + z)(n — 2 + z)- • • (l + z)z! 

Another nice property of the gamma function, which is easily evaluated 
on the pocket calculator, is Gauss’ multiplication formula 

T(nz) = (2*)( ■-- >/* n‘ r(z + \ ) 

This formula contains the duplication and triplication formulas given as a 
part of the gamma function characteristics as special cases of this more 
general multiplication formula. 

The gamma function, being related to the factorial of a number, is 
related to the binomial coefficient according to the relationship 

/ z v _ 2 !___ r( z + 1) _ 

\ w / w!(z — w)! r(w+l)r(z-w+l) 

It is apparent that the gamma function’s relationship to the factorial 
makes it convenient to evaluating the gamma function on scientific pocket 
calculators that have the factorial key. 

The gamma function can be evaluated in several ways. One is by a series 
expansion for expansion of 1 /Y according to the relationship 

1 00 

773= 2 <***, (M<°°) 

1 \ z ) k= 1 

where the coefficients to give an accuracy up to 10 places (the nominal 
register size we would expect in current and even some future pocket 
calculators) are tabulated in Table 4-1. The advantage to using this type of 
series expansion technique is that the interval over which the series 
converges is the entire real axis. Polynomial approximations can be used 
over more restricted intervals. Two such approximations are 

Y(x+\) = x\ 

= 1 + x(a Y + x(a 2 + x(a 3 + x(a 4 + a 5 x)))) + c(x) 

Y(x+ 1) = x! 

— 1 + + x(h 2 + x(^ 3 + x(b 4 + x(b 5 + x(b 6 + x(b 7 + ^g-*))))))] 

+«w 
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Table 4-1 Coefficients in the Expansions 1 /T{z) = 'Lf =l c k z k 


k 

Ck 

k 

c k 

1 

1.0000000000 

11 

0.00012 80502 

2 

0.5772156649 

12 

-0.0000201348 

3 

-0.6558780715 

13 

-0.0000012504 

4 

-0.0420026350 

14 

0.0000011330 

5 

0.1665386113 

15 

-0.0000002056 

6 

-0.0421977345 

16 

0.0000000061 

7 

-0.0096219715 

17 

0.0000000050 

8 

0.00721 89432 

18 

-0.0000000011 

9 

-0.0011651675 

19 

0.0000000001 

10 

-0.0002152416 




where the coefficients in the polynomials are 


a { = —0.5748646 
u 2 = 0.9512363 
a 3 = -0.6998588 
a 4 = 0.4245549 
a 5 = -0.1010678 


6, = —0.577191652 
^ 2 = 0.988205891 
b 3 — 0.897056937 

b 4 = 0.918206857 
b 5 = -0.756704078 
6 6 = 0.482199394 
6 7 = -0.193527818 
b B = 0.035868343 


On both of these polynomial approximations the range of the variable * is 
greater than or equal to zero but less than or equal to 1. The former has an 
accuracy of five parts in 10 5 and -the latter polynomial approximation is 
accurate to three parts in 10 7 . Also, because of Stirling’s formula for 
approximating x\, the gamma function can be related to Stirling’s 
approximation according to the equation 


T(az + b)^V2ir e az (az) az + b 1/2 (|argz| < u,a > 0) 

Again, Stirling’s approximation is easy to evaluate on most scientific 
pocket calculators. 
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4-4 THE ERROR FUNCTION AND FRESNEL INTEGRALS 


The error function and its complements are defined as 


and 


erfz = ~r f e * 2 dt 
V77 ■'0 


1 r°° 2 

erfcz =—— e 1 ^ = 1 —erfz 

V77 K 


The error function can be conveniently computed using the series expan¬ 
sion 


erfz = 



2 n 

1.3 •• • (2n + 1) 


z 2n + 1 


In fact, it is common in computing the error function on large computers 
to compute successive partial sums of the series and terminate the evalua¬ 
tion when two consecutive partial sums are equal. The same approach can 
be taken on the pocket calculator, although the calculations are tedious. 
Here again we can use rational approximations to the error function such 
as 

erfz= 1 — [/(a, + t(a 2 + a 3 t))]e~ zl + c(z), (0< z) 

where 


and the coefficients are 


t = 


_ 1 _ 

1 + pz 


= 0.47047 
a x = 0.3480242 
a 2 = -0.0958798 
a 3 = 0.7478556 

This approximation is good to about 2.5 parts in 10 -5 . Accuracy to 
1.5x10“ 5 error can be achieved with a slightly longer series, with the 
addition of two terms as 

erfz = 1 — [/(a t + t(a 2 + t(a 3 + t(a 4 + a 5 t))))e~ z2 ] + c(z) 
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where 


and the coefficients are 


1 +pz 


p = 0.3275911 
a, =0.254829592 
a 2 = -0.284496736 
a 3 — 1.421413741 
a 4 =- 1.453152027 
a 5 = 1.061405429 


The Fresnel integrals are defined by the relationships 



The Fresnel integrals can be computed using the series expansion 


00 

C(z)= ^ 


n= 0 


(2n)!(4n +1) 


OO 

S(z)= 2 


n = 0 


(2n+l)!(4n + 3) 


^4n + 3 


Fortunately, these series tend to converge quickly and can be evaluated 
effectively on the pocket calculator. 

Finally, as might be expected, the Fresnel integrals can be computed in 
terms of sines and cosines directly, but modulated by the auxiliary func- 
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tions /(z) and g(z): 

c (z)= \ +/(*)sin|^j-g(z)cos|^|-j 
«S , W=^-/(^)cos| : y-j-g(z)sin|^-j 



4-5 LEGENDRE FUNCTIONS 

Legendre functions are defined in terms of the hypergeometric functions as 


r?(z) 


1 

r(i-ft) 


z + l 

z-1 



— v,v+\;\ 



where F is defined by the relationship 


F(a,b;c;z 



n = 0 


(c). 



r(c) ^ T(a + n)r(fc + n) / z"\ 
r(a)r (b)£ 0 T(c + n) Ulj 
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This Legendre function is one of two that satisfy the differential equation 


(l-z 2 ) 


d^w 
dz 1 


t d w i 

~ 2z it + 


v(v + 1) — 


1-z 2 


w = 0 


The Legendre function of the second kind is defined by the equation 

V-1V 


Qf(z)-e il " 2 ~ iw+ 'V /2 r ^* , + 


(p + P+1)( 7 2_ i W 2 


T(v + 3/2) 






where F is again the hypergeometric function. These formidable-looking 
expressions are easily evaluated on the pocket calculator using the recur¬ 
rence relationships of varying order and degree: 


(r- fL + I)p; + ,(z) = (2v+ \)zP* (z )- (v+ v)P^_ ,(z) 

Pr'(z) = (z 2 ~\y' / 2 {(,-p)zP!(z)-(v+ n)n-,W} 

P? +l {z)= P?_ x (z) + (lv+\){z 2 -\)' /2 P?-\z) 

The Legendre functions of both the first and second kind satisfy these 
same recurrence relations. The starting values for these recurrence rela¬ 
tions are P 0 (x)= 1 and / > 1 (x) = x and 

&>(*)= CoW = i ln (yrf )’ 


Here n is nonnegative and an integer. 

Another approach to evaluating Legendre functions of integral order is 
to use Rodrigues’ formula to generate the Legendre polynomials and then 
to write them in nested parenthetical form and evaluate them like any 
polynomial. Though plausible, this approach is not developed here because 
the numerical evaluation of the Legendre polynomials is quickly done 
using the recurrence formulas. For the sake of completeness, however, the 
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other approach can be developed using the relationship 


and 


p J z ) 


1 d"(z 2 - 1)" 

2V dz n 


£?,(*) =in(jiy)- ^.-iM 


where 




w„_,(x)= 2 ^P^MP„-Jx) 


m = 1 


and 


W_ x (x) = 0 


The derivative of Legendre polynomials of the first kind can be numeri¬ 
cally evaluated using the recurrence relations 

P„(x)= [xP„(x) -P„ + ,(*)] 

1 - x* 


4-6 BESSEL FUNCTIONS 

Bessel functions are solutions to the differential equation 

2 d dw , / 2 2 \ 

z —7 + z — +(z - ^)w = 0 
dz 2 dz 7 

Of the three kinds of Bessel functions, the first is: 




f? Q k\T(r + k+\) 


The second is written as 
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and the third as 


Hi", H®(z) 

Bessel functions of the first type can also be expressed as hypergeometric 
functions and as an integral: 




(*/2) r 
r(r+i)° 1 


v+\\ 


3 2 _ 

4 




cos 0) sin 2 *' 


Here 0 F l is the generalized hypergeometric function. Bessel’s functions 
of the second and third types are written in terms of Bessel function of the 
first type to simplify their numerical evaluation according to the re¬ 
lationships 


-;- 

sm(*"77) 

H<'>(z)=J„(z) + ;y„(z) 

H1 2, (z)=J,(z)-iY,(z) 

The numerical evaluation of these Bessel functions is somewhat involved if 
done analytically. All can be computed, however, using the recurrence 
relation 


j ,+i= 

where the numerical values for the Bessel functions that go into this 
recurrence formula are given by polynomial approximations. Unfor¬ 
tunately, the startup behavior of the Bessel function for x from — 3 to +3 
is significantly different from the Bessel function evaluated for x greater 
than 3. Two levels of polynomial approximation are therefore involved. 

The polynomial approximations for the Bessel functions of the first and 
second kinds that can be used in combination with recurrence formulas for 
v equal to zero and 1 are shown in Tables 4-2 and 4-3. 
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Table 4-2 Polynomial Approximation of J 0 (*) 

%On the interval — 3 < * < +5 

J 0 (*)=l-2.2499997(|) +1.2656208(|) -0.3163866(|) 

.8 1 ° .12 
+ 0.0444479(y) - 0.0039444( |) + 0.0002100(j) + c 

Here |e| < 5 X 10~ 8 
%On the interval 3 < x 

J o (x) = x~ 1/2 f o cos0 o 

where 

f 0 = 0.79788456 - 0.00000077(3/*) - 0.00552740(3/*) 2 - 0.00009512(3/*) 3 

+ 0.00137237(3/*) 4 — 0.00072805(3/*) 5 + 0.00014476(3/*) 6 + e 
Here |c| < 1.6 X 10 8 
and 

9 0 = x — 0.78539816 - 0.04166397(3/*) - 0.00003954(3 / xf + 0.00262573(3 /xf 

— 0.00054125(3/*) 4 — 0.00029333(3/*) 5 + 0.00013558(3/*) 6 + c 
Here |c| <7X 10“ 8 


Spherical Bessel functions, often called Bessel functions of fractional 
order, satisfy the modified Bessel differential equation 

z\>" + 2 zw' + \z 2 — n(n + l)]w = 0, (/? = 0, ± 1, ± 2, • • • ) 

Spherical Bessel functions of the first kind are of the form 
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Table 4-3 Polynomial Approximation of Jj(x) 

• In the interval — 3 < x < +3 

* ' 1 Ji( x) = i - 0.56249985(x/3) 2 + 0.21093573(x/3) 4 - 0.03954289(x/3) 6 

+ 0.004433 19(x/3) 8 — 0.0003 1761 (x/3) 10 4- 0.00001109(x/3) 12 + e 
Here |c| < 1.3 X 10~ 8 

%In the interval 3 < x 

J,(x) = X l/ ' 2 f l COS0 y 

where 

/, = 0.79788456 + 0.00000156(3/x) 4- 0.01659667(3 /xf 4- 0.00017105(3/x) 3 

— 0.00249511(3/x) 4 4-0.00113653(3/x) 5 —0.00020033(3/xf + e 
Here |e| <4 X 10“ 8 
and 

9, = x - 2.35619449 + 0.12499612(3 / x) 4- 0.00005650(3 / xf - 0.00637879(3 /xf 

+ 0.00074348 (3/ x) 4 + 0.00079824(3 / xf - 0.00029166(3 / x ) 6 4- e 
Here |e|<9xl0“ 8 


and those of the second kind take the form 

Y „+1/2(z) 

The spherical Bessel functions of the third kind are given by 

h nX z )=jn( Z ) + iy„( z ) = f/f H i + l/2(z) 

k?\z)=j«U)-iy,.(z) = f^f 1 / 2 ( 2 ) 
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They can be numerically evaluated using the series 

z" [ * 2 /2 , (3 2 /2) 2 | 

J ’ (Z) 1.3.5-•• (2n +1) [ I!(2« + 3) 2!(2n + 3)(2n + 5) j 

1.3.5- • - (In — 1) | 3 2 /2 | (3 2 /2) J _1 

y " (Z z" +l [ 1! (1 -In) 2! (1 — 2n)(3 — 2«) "j 

where n = 0,1,2, • • • .Or they can be written out and numerically evaluated 
from the expansion of the spherical Bessel functions, using Rayleigh’s 
generating formulas 



A simplification for numerically evaluating spherical Bessel functions of 
high order is to evaluate them at low order, say zero and 1, and compute 
the high-order spherical Bessel functions for the same argument, using the 
recurrence relations 


. _ (2/i + l). . 

Jn+\~ Jn Jn — 1 

This recursion formula applies to all four spherical Bessel functions. 

An even simpler approach for calculators with sine and cosine functions 
using recurrence formulas is to evaluate j n {z) as 

Jn ( z) =/„ Wsinz + (- 1)" + */_(„+ dO) cosz 


where f n is generated with 


using the starting values 


( 2 / 1 + 1 ) 


/ 0 = z ' and /| = z 
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4-7 THE CONFLUENT HYPERGEOMETRIC FUNCTION 

The confluent hypergeometric function, usually written in the form 


where 


and 


tfz , ia) 2 z 7 


M (a,b,z)= 1 + — + 

b (b) 2 z\ 


+ 


+ 


(' 

( b) n n\ 


(b) n = b(b+\)(b + 2)(b + 3)---(b + n-\) 
(a) n = a(a -h l)(a + 2)(a + 3)- • • (a + n- 1) 

(tf)o=l=(Z >) 0 


is the solution to Kummers’ differential equation 


z 


d 2 w 
dz 2 


+ (b- 



aw = 0 


The confluent hypergeometric function is evaluated on the pocket calcula¬ 
tor directly as written by computing successive partial sums of this series. 
Similarly, the hypergeometric functions defined by the relation 


F(a,b;c;z)= ^ 


n = 0 




n ! 


must be evaluated directly by computing successive partial sums of the 
series. In general, when the terms in the series do not change, enough terms 
have been taken for the numerical evaluation to be complete. It is apparent 
that the series is not defined if c= ~m{m — 0,1,2, • • •), except when 


b = — n(n = 0,1,2, • • •) 


where n must be less than m. It is worth mentioning here that the 
hypergeometric function can be used to initialize certain recursion formu¬ 
las for other advanced functions in that the hypergeometric function is 
related to many of the orthogonal polynomials. 


4-8 CHEBYSHEV, HERMITE, AND LAGUERRE POLYNOMIALS 

Chebyshev polynomials can be easily evaluated numerically using the 
recurrence equations 

T„ + l (x) = 2xT n (x)-T„_ i (x) 

where the starting values for the Chebyshev polynomials are T 0 (x) — 1 and 
T l (x) = x. 
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Hermite polynomials and Laguerre polynomials can also be evaluated 
using recurrence equations and initial values of the polynomials. For 
example, the Hermite polynomial can be numerically evaluated using the 
recurrence formula 


H„ +l (x) = 2xH„(x)-2nH„_ l {x) 

where the starting values are computed from the first two Hermite 
polynomials H 0 — 1 and H { =2x. Again, n is a nonnegative integer. 

The numerical evaluation of the Laguerre polynomial is found using the 
recurrence equation 

, , , l(^n+\-x)L n (x)-nL n _ x (x)] 


Here the starting Laguerre polynomials are L 0 = 1 and L x — \—x. As 
before, n is a nonnegative integer. 

It is worth repeating that to numerically evaluate advanced mathemati¬ 
cal functions one of three approaches is usually employed: 

1. The function is approximated by a polynomial approximation or 
curve fits that permit accurate evaluation of the function directly through 
analytic substitution. 

2. If the function is one of a sequence of generated polynomials, the 
low-order polynomials can be determined for the argument of the function, 
and then recursion formulas used to numerically evaluate the higher-order 
polynomials. 

3. The third alternative is simply to compute successive partial sums of 
the series that describes the advanced function. 

Of the three, (1) is the most straightforward approach to evaluating the 
advanced functions. The least attractive approach is that presented in 
(3)—direct evaluation of the series approximation to the function. And, 
finally, a reasonable tradeoff between analytic substitution as described in 
(1) and direct series evaluation presented in (3) is the use of recurrence 
formulas to numerically evaluate high-order functions where the function 
is one of a set of sequences of functions developed with a generating 
formula. 
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ADVANCED ANALYSIS 
ON THE POCKET CALCULATOR 




CHAPTER 5 


FOURIER ANALYSIS 


5-1 INTRODUCTION 

We now turn to the Fourier analysis of discrete and continuous functions. 
Unlike in Chapter 4, where we talked about the numerical evaluation of 
advanced mathematical functions rather than their interpretation, here we 
also try to understand the results that can be obtained from pocket 
calculator evaluation of the discrete Fourier transform. Such issues as the 
relationships between the discrete spectrum of discrete functions and the 
discrete spectrum of continuous functions are discussed. The aliasing 
concept is examined to aid those not familiar with it in understanding the 
spectrum of sampled-data functions. In a very real sense, the pocket 
calculator can be a valuable teaching aid in frequency-domain analysis in 
that it permits the quick evaluation of the spectra associated with discrete 
functions which (when sampled at sufficiently high frequency) approxi¬ 
mate continuous functions. For this reason, and to permit a quick evalua¬ 
tion of spectra in practical analysis, the formulas for 12-point and 24-point 
discrete Fourier spectra are given and the procedures for their quick 
evaluation on the pocket calculator are presented. Those using a simple 
four-function calculator will find the 12-point formula of particular inter¬ 
est, since no evaluation of sines or cosines is required for a determination 
of the discrete spectrum of a sequence of sampled values. 

Finally, we discuss function reconstruction, using pocket calculators 
with scientific keyboards, and procedures for Fourier series evaluation. 


5-2 THE FOURIER SERIES OF CONTINUOUS FUNCTIONS 

The Fourier series of a continuous periodic function whose period is L is 


139 
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given by 


F{x)=~f + 


j 2 r nx \ / Attx \ / 

a x cosl —— I + a 2 cosl —— I + a 3 cosl 


677.x: ' 

L 


+ 


+ 


b x sin 


2ttx 

L 


+ b 2 sin 


477 * 

L 


+ b 3 sin 


677* 

L 



(5-1) 


where the coefficients a k and b k are given by 


£ 

II 

i F(x)co i l 

^dx, 

(* = 0,1,2,- 


(5-2) 

11 

l 

) dx. 

<N 

II 

•) 

(5-3) 


This series has an infinite number of terms. There are an infinite number 
of coefficients defined by equations 5-2 and 5-3. Physically, the funda¬ 
mental and an infinity of its harmonics can determine an infinite sequence 
of coefficients for the sines and cosines in the series expansion of equation 
5-1. 

There are problems with the convergence of Fourier series, which are 
usually associated with special functions not frequently encountered in 
engineering analysis. Since the sines and cosines that make up the Fourier 
series are orthogonal functions, the coefficients of its components can be 
written in polar form. In this case the modulus of the radius vector can be 
plotted as a function of frequency, as well as the phase angle of the 
resultant. These form the amplitude and phase shift curves used by control 
systems analysts to examine the stability of feedback control processes. 

Finally, the power spectrum of a function approximated with a Fourier 
series can be prepared by plotting the square of the modulus of the Fourier 
coefficients in polar form. Since this is equivalent to summing the squares 
of the coefficients for the sine and cosine functions at the same frequency, 
it is apparent that the phase information is lost when the power spectrum is 
presented. 

Inspection of the Fourier series reveals that if the function F(x) is 
moved vertically along the ordinate, only the value of a 0 is changed. This 
can readily be seen by noting in equation 5-2 that when k = 0 the 
coefficient becomes the average value of F(x) on the interval zero to L. 
Note that if we were to set F,(x) = F(x)+ a 0 /2 in equation 5-1, the Fourier 
series of F\(j t) would have no average component (DC component). It is 
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also apparent that if F(x) is translated horizontally along the abscissa, the 
distribution of weight among the coefficients in the Fourier series changes. 
In particular, for any given frequency component, the distribution among 
the sine and cosine terms is different. However, the sum of their squares 
remains invariant. From this we see that the power spectrum of a Fourier 
series is invariant with respect to translation of the function/(x) along the 
abscissa (assuming that/(x) is periodic). 

What should be kept in mind about the Fourier series approximation of 
a function is that the function being approximated with the Fourier series 
is assumed to be periodic. Though it can have discontinuities over the 
period, the function must be periodic. For the student, what might seem 
counterintuitive at first is that this function defined over a finite interval in 
the domain of the real numbers has an infinite number of lines in its 
discrete spectrum defined over the infinite domain of the discrete frequen¬ 
cies. The reason for this lies in the nature of the Fourier series. The 
fundamental frequency component for the finite length record is set by the 
length of the record itself. Then the harmonics that make up the rest of the 
series are determined by pairs of sines and cosines that fit with multiple 
oscillations over the period of the periodic function, and there are an 
infinity of these sines and cosines. 


5-3 THE FOURIER SERIES OF DISCRETE FUNCTIONS 

Now let us consider a discrete function defined by a set of equally spaced 
discrete values of the function Y defined at equally spaced values of the 
domain of the independent variable. Following Hamming, we consider 
only an even number of points, 27V. In what follows, the 2TV sample points 
are 

n L 2L 3L (27V — 1)L 

°’ 2/V ' Yn ’ Yn 27V (Ml 

which can be written as 

x n=Ttr ("“O’ 1 . 2/v -') (5-5) 


The Fourier series expansion of an arbitrary function F(x) defined on the 
set of points x n can be written as 




*=1 L 


A k cos 


Ikirx 


+ B k sin 


Ik'l TX 

L 



2/Vttx ' 

L 


(5-6) 
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where 


1 1 / 2/c7 tx \ 

Z fl>„)cos [—f , (*-0,1,...,*) (5-7) 

« = 0 ' ' 


1 NT' ( 2k'TTX„ \ 

B k = jj Z — J-Z , (* = 0,1. N— \) (5-8) 

n = 0 ' ’ 

An interesting aspect of Fourier series is that the sine and cosine 
functions are not only defined and orthogonal over a continuous interval 
of the dependent variable but are also orthogonal on any set of equally 
spaced discrete points on the same interval. This is important for the 
numerical evaluation of the frequency components of the discrete func¬ 
tions in that we are usually only given samples of the function on the set of 
equally spaced points. One might expect that the coefficients for the 
continuous Fourier series could then be approximated using numerical 
integration. Although this, in fact, can be done, equations 5-7 and 5-8 show 
that the coefficients for the Fourier spectrum of a discrete function can be 
computed exactly without involving numerical integration approximation. 

It is apparent from equations 5-7 and 5-8 that this expansion will have 
only 2 N terms, rather than an infinity of terms, as is characteristic of the 
Fourier series approximation of continuous functions. Also, the frequency 
spectrum for the discrete Fourier transform will have only half as many 
lines as sampled values. Thus, if there are 10 sampled values, the power 
spectrum will have only five discrete lines. Clearly, a question that needs to 
be answered is, ‘what happened to the rest of the infinity of components of 
the spectrum of the continuous function F(x)7 Another relevant and 
equally important question is, how can the discrete spectrum be con¬ 
structed if we know the continuous spectrum and the number of sampled 
values? Conversely, given the discrete spectrum of the discrete function 
f(nhx), what can we tell about the discrete spectrum of the continuous 
function /(*)? 


5-4 THE RELATIONS BETWEEN THE FOURIER SERIES EXPANSION OF 
DISCRETE AND CONTINUOUS FUNCTIONS 

Only a little analysis will show that the spectral components of a discrete 
function are related to the spectral components of its continuous function 
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counterpart according to the relationship 

00 

^k~ a k + 2 { a 2Nm-k a 2Nm + k) (5‘9) 

m = 1 

Bk = bk+ 2 {^2Nm+ k ~ ^2Nm-k) (5'10) 

m= 1 

The constant term A 0 is given by 

^0 =a 0“*”2 2 a 2Nm (^'H) 

m = 1 

Let us examine equation 5-9. The first six terms of an even function 
discrete spectrum (say, for 2=10 seconds) when written out take the 
following form: 


Spectrum 

Component 

Discrete Function Spectrum from Continuous Function 

Spectrum 


^0 = a 0+ 2 ( a \0+ a 2Q Jr ~ ^30“*“ ^40 ‘ ‘ ‘) 

a u= lHz~ 

^ 1 = a i -|~ ( a 9~^ a \\) ( a l9~^ a 2\) i a 29+ a 3\) 

a u = 2Hz = 

A 2 = a 2 + (a 8 + a l2 ) + (a l8 + a 22 ) + (a 2S + 32 ) + ' * • 

«.=3Hz= 

A 3 = a 3 + ( a-j - 1 ~ a + (a X1 + a 13 ) + (a 21 + a 33 ) +••• 

a u=4Hz = 

A 4 = a 4 +{a 6 +a x ^) + (a ]6 + a 24 ) + (a 26 + a 34 ) + • • • 

2(X u = 5Hz = 

- 1-(^5 + ^ i5 ) + (^ 15 + a 25 ) + (n 25 + ^ 35 )+ 

Note: B k = 0 = 

b k for even functions. 


The factors of 2 at end points of the spectrum are due to the form of 
equation 5-6. Each of the discrete system coefficients when written out 
includes the effect of an infinite number of terms associated with the 
continuous spectrum. This somewhat stunning finding shows that any 
sizable power in the high-frequency components of the continuous- 
function spectrum will have the effect of those components appearing at 
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low frequencies in the discrete-function spectrum. In the example chosen 
where the 2 N is equal to 10 seconds, the lowest-frequency component in 
the discrete spectrum (besides the zero-frequency component) includes the 
amplitude of the 9 and 11 Hz frequency component in the Fourier series 
expansion of F(x), as well as the 19th and 21st, the 29th and 31st, 39th and 
41st, 49th and 51st, and so on. 

Another physical way to think of this is that, had we been sampling an 
11 Hz sine wave at 10 samples per second, the discrete function would 
have had spectral components at 1 Hz whose amplitude would be the same 
as that of the 11 Hz sine wave in the continuous spectrum. In general the 
frequencies present in the original continuous function F(x) are summed 
together as a result of the sampling operation. In other words, the mere 
operation of sampling a continuous function (i.e., a discrete function 
defined only at integer multiples of the sampling period) results in the 
folding of the frequency spectrum around the information frequency 
(sampling frequency divided by 2), resulting in the folding of the 
high-frequency components of the continuous-function spectrum into the 
low-frequency components of the discrete-function spectrum. This effect is 
called “aliasing” (i.e., 1 Hz is the alias of 9 Hz when sampled at 10 Hz). 
Note that once the sampling process has taken place, its effect on the 
continuous-function spectrum cannot be undone. In our example, only 
when a k = 0 for k >5 will A k = a k . If a k ^0 for k> 5, a k cannot be 
determined by examining A k \ Finally, note that the highest-frequency 
component is co^/2. 

These formulas provide the means for computing the discrete-function 
spectrum directly from the continuous-function spectrum if it is known. 
The CRC Standard Mathematical Tables has a sizable table of Fourier 
series representations of commonly encountered functions and can be used 
to compute the discrete-function spectrum according to equations 5-9, 
5-10, and 5-11 using a pocket calculator. 


5-5 THE NUMERICAL EVALUATION OF THE FOURIER COEFFICIENTS 


The coefficients of the discrete-function Fourier series expansion, pre¬ 
viously given by equations 5-7, and 5-8, are recopied here for convenience. 


IN — 1 


1 l 2kvx n \ 

A k = -h2j m,)cos —, (*=0,1. 

« = 0 ' ' 

IN- 1 , 

e * = a? 2 f w sin ( 

n = 0 ' 


N) 


(5-7) 


/ 2 A" 77.x: M 


(* = 0,1,...,/V— 1) 


(5-8) 
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These coefficients may be numerically evaluated, using recursion formulas. 
The procedure is as follows: 

Step 1 Prepare a table of values U m using the recursion relations 
^ = (2cos^jt/ ra _ 1 -t/ m _ 2 +F(2iV-m) for m = 2,3,4,...,2A'-1 

(5-17) 

where U 0 = 0 and £/, = 7(27V—1) 

Step 2 Evaluate the coefficients of the cosine terms in the series, using the 
equation 

^=^{(cos^)t/ 2 ^,-l/ 2 „- 2 +F(0)j (5-18) 

Step 3 Compute the coefficients for the sine terms in the series expan¬ 
sion, using the recursion formula 

B *~i/( liaS ir) u ™-' (5_19) 

While evaluation according to these formulas takes longer than does the 
Cooley-Tukey fast Fourier transform algorithm, for the low-order Fourier 
series analysis that can be conveniently done on the pocket calculator this 
recursion formula method involves only slightly fewer operations than the 
Cooley-Tukey algorithm. Moreover, the use of recursion formulas in the 
numerical evaluation of functions is efficiently done on the pocket cal¬ 
culator. 

Hamming presents a convenient 12-point formula for Fourier analysis in 
his book Numerical Methods for Scientists and Engineers. First, the table of 
discrete values of the function F is written in an array (A 1): 

F( 0) F( 1) F( 2) F( 3) F( 4) F( 5) F( 6) 

7(11) F( 10) F( 9) 7(8) F( 7) 

From this array we can compute a sequence of 5’s and 7”s by adding and 
subtracting, respectively, the two lines in the array A 1 to form an array 
(A 2) of S’s and 7’s: 

Sums^ 5(0) 5( 1) 5(2) 5(3) 5(4) 5(5) 5(6) 1 

Differences-^ T(l) T( 2) T{ 3) 7(4) 7(5) j 




146 


Fourier Analysis 


Then, rewriting array (A 2) as 

5(0) 5(1) 5(2) 5(3) T{ 1) T( 2) T( 3) | ^ 

5(6) 5(5) 5(4) T(5) T( 4) J 

we form array (/14) of t/’s, F’s, P’s, and £>’s by adding and subtracting, 
respectively, the second line from the first line in array A3: 

Sums—> t/(0) U( 1) </(2) £/(3) P(l) P( 2) P(3) 1 4 

Differences^ K(0) F(l) F(2) £0) Q( 2) J 

The coefficients associated with the six discrete frequencies that make 
up the Fourier series representation of the 12-point discrete function can 
now be developed. First compute 


Then 


V(2) 

« ] = ym + -~ 

ct 2 = U(0)+ (7(3) 
a 3 = £/(!)+ (7(2) 

« 5 =t/(0)-{/(3) 

« 6 =(/(l)-l7(2) 
«,= K(0)- K(2) 



+ p(V 


fi 2 ~P(l)-P(3) 

ft=^y-(e(D+e(2)) 



^0=H«2+«3) 

^ 1 = i(«i + « 4 ) B 1 = i(fii + Ps) 
A 2~ ~2 j B 2~z(^3 

A 3 = ^ a l B 3 = i@2 

A 4=\[k2~-^ B 4~Z^4 

A 5 = ^( a l~ a 4) B 5 = 6 ( fi\~ fis) 

A 6=i( a 5~ a 6) 
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These coefficients are associated with the Fourier series 

n- 1 

( . 

. cos 




k= 1 


L 


L ) 


+ cos 


IttNx 

L 


5-6 SUMMARY 

A number of important observations have been made on the Fourier 
expansion of discrete functions. Take, for example, the 12-point Fourier 
coefficient formulas. The 12 values of the discrete function result in a 
discrete spectrum with only six frequency components. In general, 2N 
values of a discrete function will result in a spectrum with only N spectral 
components. This generally holds true, reflecting the rule of thumb that 
data must be sampled at at least twice the highest frequency of interest for 
the coefficients of the spectral components to be determined at the 
frequency of interest. 

The second observation deals with the physical characteristics of 
sampled data. Note that once a function is sampled the spectrum of the 
sequence of sampled functions has no frequency components greater than 
half the sampling frequency. What, then, happens to the high frequency 
components of a sampled continuous function? They are folded down into 
the low-frequency components of the discrete spectrum. They are summed 
with the low-frequency components. In this sense the high-frequency 
components are not actually rejected by the sampling process, but they are 
folded into the low-frequency components of the discrete function 
spectrum, resulting in distortion of the low-frequency components that 
made up the original continuous-function frequency spectrum. Thus, 
although the sampling operation does not result in high-frequency com¬ 
ponents in the sampled-functions spectrum, the true continuous-function 
spectrum can be much distorted by the sampling process in the low 
frequencies. 


5-7 REFERENCES 

For this chapter refer to Richard Hamming’s Numerical Methods for 
Scientists and Engineers (McGraw-Hill, 1973), Chapter 1 and Chapters 31 
through 34. 

Example 5-1 Compute the coefficients in the Fourier series expansions of 
a continuous periodic triangular wave function. Then use these coefficients 
to compute the discrete function spectrum by way of equations 5-9 through 
5-11. 
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The continuous periodic triangular wave form is shown in Figure 5-1. It 
is apparent that 


/(') = -K-t) 

and 

The first equation shows that f(t ) is an odd function and thus only the sine 
components are involved in the Fourier series approximation of f{t). The 
second equation shows that only the odd harmonics are involved in the 
series. Furthermore, when two symmetry conditions exist, it is necessary to 
integrate only over one-quarter of the period of the function to determine 
the Fourier coefficients (an interesting property for the reader to dem¬ 
onstrate to himself). It follows, that 




Figure 5-1 Continuous periodic triangular waveform. 
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-8 A 

IT^TT^ 


«= 1,5,9,- 


= 3,7,11,- 


The first 15 continuous function spectrum components are tabulated 
below. 


bx 2 = 0 


, 8 A 

13 169t7 2 


6q = ^M = 0 


^io~ 0 


The discrete spectrum components are developed by using equation 5-10 
for the case 2N = 10: 

B\ — b x + {b u — b 9 ) + (& 2 i — b ]9 ) + (£31 ~ b 29 ) + • • • 

B 2 — b 2 + (& 12 — b%) T (^22 ~ ^18) (^32 ~~ ^28) 

^3 = ^3 + (^13 ~ ^7) + (^23 _ ^17) + (^33 ~ ^27) ' 

B 4 = b 4 + (^14 — ^ 6 ) + (4*24~ ^16) "f" (^34 — b 2 ^} + • • • 

From the table of continuous function spectrum components we see that 


77 


= 0.9793900 


B 2 s "^T [(0 + 0) - 0] = 0 


9 + 169 


-1] =0.0847859(^4 

/ \ 77 


^ 4 ^-^ f [(0 + 0 )-( 0 )] = 0 
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The following table shows the discrete function spectrum elements and the 
continuous function spectrum elements side by side for easy comparison. 
The difference is the result of the aliasing phenomenon. 

Continuous Function Discrete Function 

Spectrum Elements Spectrum Elements 

= B, = 0.9773900 




Example 5-2 Compute the Fourier coefficients for the Fourier series 
approximation of the discrete function 

y n = sin(nwT) 

where to= 1 Hz. This 12-point discrete function is tabulated below. 


n 

nT 

(seconds) 

ncoT 

(degrees) 

Sin(rtWi 

0 

0 

0 

0.00 

1 

0.1 

36 

0.59 

2 

0.2 

72 

0.95 

3 

0.3 

108 

0.95 

4 

0.4 

144 

0.59 

5 

0.5 

180 

0.00 

6 

0.6 

216 

-0.59 

7 

0.7 

252 

-0.95 

8 

p 

oo 

288 

-0.95 

9 

0.9 

324 

-0.59 

10 

1.0 

360 

0.00 

11 

1.1 

396 

+ 0.59 


A closer examination of the table reveals that the sine function is 
tabulated over the interval to 1.2 seconds, while the function is periodic on 
the interval 1 second. Clearly, the coefficients that we generate using the 
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12-point formulas apply to Fourier series approximations of the tabulated 
functions exactly as shown (with a period of 1.2 seconds, not 1 second), 
and not the coefficients for the Fourier series approximation of a pure 1 
Hz sine wave. The purpose for selecting this unusual problem is that it 
illustrates not only the use of the 12-point formula, but also the effect of 
one of the practical problems associated with sampling periodic functions. 
The reality of sampling functions from experiments is that the functions 
are often not exactly periodic or, if they are, the period is not known 
precisely and some approximation of the period must be made. This 
example might be considered the result of an experiment where an estimate 
of the period of the function being sampled is made to be 1.2 seconds, 
where in reality the periodic function repeats on the interval 1 second. 
Following the 12-point Fourier analysis procedure, the discrete function 
12-point Fourier coefficients are generated from the arrays as tabulated in 
Table 5-1. The array numbers in the table correspond to the array numbers 
in the text and are shown here for the sake of convenience. The numerical 
evaluation of the 12-point Fourier series coefficients are summarized in 


Table 5-1 Discrete Function 12-point Fourier Series Coefficient Generation Arrays 


r (0)-*F(6) 

0.00 

0.59 

0.95 

0.95 

0.59 

0.00 

-0.59 


(A 1) 

r (12)«- F(7) 


0.59 

0.00 

-0.59 

-0.95 

-0.95 




idd 

0.00 

1.18 

0.95 

0.36 

-0.36 

-0.95 

-0.59 


(A2) 

ubtract 


0.00 

0.95 

1.54 

1.54 

0.95 





S(0) 

SO) 

S{ 2) 

5(3) 

5(4) 

5(5) 

5(6) 


(A 2) 



T(l) 

T(2) 

T(3) 

n 4) 

T( 5) 




'(0)->5( 3) 

0.00 

1.18 

0.95 

0.36 

0.00 

0.95 

1.54 

T(\)->T(3) 

M3) 

X6)^S(4) 

-0.59 

-0.95 

-0.36 


0.95 

1.54 


T(5)«— T(4) 


i.dd 

-0.59 

0.23 

0.59 

0.36 

0.95 

2.49 

1.54 


(A4) 

ubtract 

0.59 

2.13 

1.31 


-0.95 

-0.59 





(U(0) 

U( 1) 

U( 2) 

U( 3) 

P{ 1) 

P( 2) 

P( 3) 


(A4) 


m 

V(\) 

V(2) 


<2(0 

0(2) 
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Table 5-2. The ^-coefficients are associated with the cosine components 
and the 5-coefficients are associated with the sine components. Also, the 
table shows a check of the initial conditions. At / = 0, the discrete function 
starts at 0. Therefore, the sum of the cosine amplitude coefficients should 
equal 0, as they do. 

Though we could discuss the tail effects by examining the individual 
elements of the series expansion, it is more convenient to use the power 
spectrum or amplitude spectrum as a means of discussing this 
phenomenon. The 12-point spectrum calculations are tabulated in Table 
5-3. First note that the DC component of the spectrum is given by P 0 . This 
indicates that the average effect of the “tail” of our irregular periodic 
coefficient is to bias the otherwise 0-DC coefficient to the level 0.049. 
Second, the lowest-frequency component (the fundamental frequency 
equal to 1/1.2 equal to 0.83333...) contains the greatest amount of power 
of all of the harmonics. Clearly, this is so because it is the closest frequency 
to the 1 Hz periodic function that we have sampled. The power in the 
next-highest harmonic is approximately one-tenth that of the fundamental. 

Table 5-2 12-Point Fourier Series Coefficient Calculations 

Note that 
at t = 0, 
2^i= 0, 
as it should 


5, = I (0.475 + 1.54 + 0.866 X 2.489) = 0.695 

B 2 = — ~ (- 0.95 - 0.59) = - 0.2223 

B 3 = 1(0.95- 1.54) = 0.098 

B 4 = (- 0.95 + 0.59) = - 0.05196 

B 5 m 1 (0.475 + 1.54 - 0.866 X 2.49) = - 0.0436 



f 2A 0 = 1 ( - 0.59 + 0.36 + 0.23 + 0.59) = 0.098 
A , = 1 (0.59 + 0.655 + 0.866 X 2.13) = 0.5149 
A 2 =U- 0.59 — 0.36-. 18)= -0.188 
A 3 = ^( — .72)= -0.12 
A 4 = ^ (0.59+ 0.36 —.41)= —0.107 
A 5 = ^ (0.59 + .655 - 0.866 X 2.13) = - 0.0999 
2A 6 — -h( —0.59 —0.36 —0.23 + 0.59) = -0.098 


Sine 

components 
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Table 5-3 

12-Point Spectrum Calculations 




Power Spectrum 

Amplitude 

wHz 

(rounded) 

Spectrum 

0 

p o = ^/4 =0.00 

V> 0 

= 0.049 

0.833 

P, = Af+B?= 0.75 

V7h 

= 0.865 

1.666 

P 2 = A 2 +B 2 = 0.08 

a fp 2 

= 0.291 

2.499 

P 3 = A 2 +B 2 = 0.02 

V7 >; 

= 0.155 

3.333 

P 4 = Aj+B 2 = 0.01 

V7>4 

= 0.119 

4.166 

P 5 = A 2 + B 2 =0.01 

vt 5 

= 0.109 

5.499 

P 6 = A 2 /4 =0.00 

V^6 

= 0.049 


Had we taken the 12 sample points equally distributed over the periodic 
sine wave function, we would have found a single harmonic component at 
1 Hz and the rest of the components would have been zero or very small, 
depending only on truncation error as related to the number of terms 
carried in the pocket calculator analysis. Here we see the effect of the 
“tail” is to affect the DC level and spread the power in the 1 Hz sine 
function over higher-frequency harmonics. The reason for this is that the 
high-frequency components are required to take care of the discontinuous 
end effects associated with the “tail” in our sampled periodic discrete 
function. Specifically, this “tail” is associated with the jump discontinuity 
in going from +0.59 at n = 11 to 0 at n— 12 for the example function that 
we have chosen to analyze. Hopefully this example will interest the 
students who read this book in further readings in practical Fourier 
analysis, on which there is an extensive literature. 




CHAPTER 6 


NUMERICAL INTEGRATION 


6-1 INTRODUCTION 

There are basically two types of integral with which we are concerned in 
this chapter: the definite integral and the indefinite integral. The definite 
integral is given by the formula 

y(b)=y(a)+ f f(x)dx (6-1) 

*a 

and the indefinite integral is defined by 

y(x)=y(a)+ f f(t)dt (6-2) 

The definite integral is characterized by computing the area under the 
curve of a bounded function; the indefinite integral can be thought of as 
computing the antiderivative of the integrand and thus generating the 
sequence of values of a function. We study definite integrals from the 
standpoint of quadrature—that is, for computing the area under a curve. 
We study indefinite integrals from the standpoint of integrating differential 
equations. Our first concern here is the definite integral. 


6-2 DEFINITE INTEGRATION 

Computing the area under an arbitrary curve is usually based on the 
concept of analytic substitution. The idea is to use a known function 
whose definite integral is easily evaluated to substitute for the arbitrary 
function to be integrated. The integration is actually performed on the 
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substitute function and attributed to the integral of the arbitrary function 
to the degree to which it approximates the latter. In classical mathematics 
the substitute functions to be integrated are usually polynomials. The 
polynomial is then analytically integrated and, insofar as the polynomial 
approximates the continuous function, the integral is attributed to the 
integral of the arbitrary function. When the integrand is a polynomial of 
degree n and the approximating function is also a polynomial of degree n, 
the formula can be made exact by appropriately selecting the coefficients 
in the integration formula. 

The process of analytic substitution or of other means of approximating 
definite and indefinite integrals is so fascinating that virtually every 
numerical analyst finds new ways to rederive many of the classical formu¬ 
las and a few others as well. Though one is tempted to present the most 
sophisticated integration methods, the focus here remains on classical 
developments, which are straightforward and easy to apply on the pocket 
calculator. The reader should be aware, however, of the tremendous 
quantity of good mathematics in numerical integration developed in the 
last 20 years. This is due to numerical calculations being done on the 
digital computer and to the use of numerical analysis in sophisticated 
technology problems in varied areas. Structures, communications systems, 
control systems, design of aircraft, and the design of chemical plants are 
areas where the simulation of systems with widely separated eigenvalues 
and the numerical integration of functions that are almost neutrally stable 
(at large integration step size), have produced new integration concepts 
based on the technology to which they were being applied. Structural 
dynamicists have developed special numerical integration formulas for 
integrating their “stiff differential equations.” Controls analysts have pro¬ 
duced such formulas based solely on frequency-domain considerations. 
And special single-step real-time numerical integration formulas have 
been developed by simulation scientists. 

These problems can be encountered on the pocket calculator, especially 
the programmable pocket calculator. Here, however, we focus on the more 
classical formulas, which have fairly general and broad applications to the 
more analytically tractable functions. Furthermore, there is a vast body of 
literature on these classical methods for further reference, should it be 
required. 


Trapezoidal Integration 

If we approximate the function f(x ) on a bounded interval a < X < b by a 
line through the end points, we can write the equation for the approximat- 
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ing function over the interval as 


y(x)—f(a) + 


m-m 


y(x)= 


(b-x)f(a) + (x-a)f(b) 
b — a 


Integrating equation 6-3, we find 



mimy b - a) 


(6-3) 


(6-4) 


Equation 6-4 computes the area under the straight-line interpolation be¬ 
tween the two end points. This is called trapezoidal integration because 
this area is enclosed by a trapezoid formed by lines connecting the end 
points, the abscissa, and the vertical lines connecting the end points to the 
abscissa. If the interval is large, the trapezoidal approximation can lead to 
large numerical integration error. This is resolved by a repeated applica¬ 
tion of the trapezoidal rule on smaller intervals of the dependent variable. 
When this is done for equally spaced intervals, Ax, trapezoidal integration 
takes the form 


r b ff(a) f(b)\ 

J f(x)dx = Ax I ——— hf(a + Ax) + f (a + 2Ak) + • * • -f —-— j (6-5) 

Trapezoidal integration, though not the simplest one to derive or com¬ 
pute (Euler, modified Euler, or rectangular integration are simpler con¬ 
cepts) and its error formula does not give the least error for the fewest 
computations, is straightforward to apply on the pocket calculator and is 
easily remembered. As we move to integration formulas involving mid¬ 
values and their derivatives, estimates of a roundoff and truncation error, 
and adjustments of phase shift and amplitude, we retreat further from 
simple visualizations of the integration process and must increasingly rely 
on the rationale for their development to be assured of their applicability 
to a problem. Ultimately, analytical integration is compared with the 
approximate numerical integration to evaluate the difference between 
several methods of integration for a particular problem. Clearly, this is an 
overkill for back-of-the-envelope engineering analysis or analysis on the 
pocket calculator intended simply to compute the area under the curve of a 
given function. If trapezoidal integration is sufficiently accurate, and the 
number of intervals needed to obtain the desired accuracy is not prohibi¬ 
tive, it is very useful for pocket calculator analysis. 
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6-3 ERROR IN TRAPEZOIDAL INTEGRATION 


We do not here aim to explore the derivation of integration or error 
formulas—merely to tabulate the commonly used ones and put them in a 
form that is immediately useful for the pocket calculator. Nevertheless, it is 
instructive to examine the error of a simple integration formula as a means 
for understanding the error equations given for the more sophisticated 
integration formulas. Following Hamming, then, we examine the trunca¬ 
tion error in the trapezoidal integration algorithm by substituting a Taylor 
series expansion into the integration formula. By comparing both sides of 
the results, we can then determine the error associated with the analytic 
substitution process in the numerical integration. Specifically, if we write 
the integrand in its Taylor series expanded form as 

f(x)=f(a) + (x-a)f'(a)+ - - ^ f"(a) +• • • (6-6) 

and substitute this into both sides of the trapezoidal integration formula, 
we find that, on integration, the left side becomes 


(b-a) 

1 ! 


/(*) + 


(b-a) 

2 !~ 


/'(*) + 


(b-af 

3! 


■/»+■*• 


The right side becomes 


(6-7) 


Ax 

2 


f(a) + (b-a)f(a) + 


(b-af 

2 


■/"(«)+•■•+/(*) 


+ e 


(6-8) 


where Ax = (b — a). After canceling like terms on both sides we can derive 
the truncation error formula: 


(b — a) 

c+—r—f"(a) + 




(*-") f"( \ 

= 3! f ^ 

(6-9) 

(b-a)\ 

5 /(«)*•• 

(6-10) 


If we assume the largest part of the error term to be given by the first term 
in its series expansion, we can write 


(b~a) f"(a) 

12 


( 6 - 11 ) 
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or, more generally, 

£ _ (b-a) 3 f"(0) 


(a < 9 < b) 


( 6 - 12 ) 


If, however, the function has contributions to the error formula that are 
large in the higher-order terms, this error formula does not apply. It is 
applicable for many of the practical engineering problems, and thus is 
generally quoted as the error associated with trapezoidal integration. 

The specific error formula for trapezoidal integration is less important 
here than is the method by which it is derived. We used the Taylor series 
expansion for the integrand in order to derive a Taylor series truncation 
term for “the area.” Another alternative would have been to use a Fourier 
series representation of the function to determine the truncation in the 
frequency domain. Another approximating polynomial could have been 
the Chebyshev polynomial approximation of/(x), which would have given 
another type of truncated polynomial approximation error formula. While 
the interpretation of the results of each error formula is different, the 
magnitude of the error is not. The error is a characteristic of the integra¬ 
tion formula, rather than the approximating polynomial used in the error 
formula evaluation. 

Figure 6-1 shows that for concave-up type of functions trapezoidal 
integration is always slightly more than the curve it is trying to approxi¬ 
mate; for concave-down type of functions it is slightly less. Thus it seems 



Figure 6-1 Truncation error in trapezoidal integration. 
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reasonable to expect, when integrating “wavy” functions, the intervals to 
be set up so that, at a minimum, the eyeball approximation of the errors on 
one interval may have a chance to cancel the errors on the other interval. 
We can extend the error formula for simple trapezoidal integration to the 
composite formula by similar reasoning: 


(b — a) Ax' 

12 


■f"(0), (a<9<b) 


(6-13) 


Writing error formulas such as equations 6-12 and 6-13 is, of course, 
easier than evaluating them meaningfully. One approach is to find the 
second derivative of the function being considered, compute the minimum 
error and the maximum error, and divide by 2 to obtain the average error 
of the integration over the interval. Another approach is to take the 
worst-case error. A great number of other alternatives also exist. The 
question is, what is the criterion for numerically evaluating the error? 
Unfortunately, there is no easy answer to this question. From an engineer¬ 
ing viewpoint, the error defined by equation 6-9 perhaps has more meaning 
than those most often quoted in numerical analysis books. In this sense the 
process of deriving the error formula is the more fundamental issue in that 
the engineer or scientist can compute his own error formula suited to his 
specific problem. 

Another aspect of the numerical error formulas associated with integra¬ 
tion formulas is that they are absolute errors, whereas the error of interest 
is usually relative error. Again, the author has no easy solution of the 
problem of deriving relative error formulas for numerical analysis. The 
difficulty is pointed out here to warn the student or first-time numerical 
analyst about error formulas in general. Preferably he should derive his 
own formula for a particular problem being numerically analyzed. An 
estimate of the error in a numerical approximation over an analytical 
calculation must be made, but its interpretation is not straightforward and 
the result cannot be casually given from questionable error formulas. 


6-4 MIDPOINT INTEGRATION 

Midpoint integration uses the midvalue of an interval and the derivative of 
the integrand evaluated at the midvalue to define the slope at the midpoint 
of the interval, again forming a trapezoid whose area under the curve 
approximates that of a function to be considered. 

The midpoint integration formula as developed by Hamming is easy to 
follow and nicely introduces the concept of a general approach to deriving 
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polynomial approximations for analytic substitution. We are to derive an 
integration formula of the form 


jj(x)dx -*-■/( + *J' (*¥) (6-14) 

Hamming’s weighting coefficients can be easily derived by noting that we 
first require this formula to be exact for f(x) = 1. This gives 

b — a = w { (6-15) 


We also require that this formula be exact for f(x) = x, which leads to 


b 2 -a 2 


2 



a+ b 
2 


+ W2 


(6-16) 


We can determine the two Hamming’s coefficients by solving these equa¬ 
tions simultaneously: 

I b 2 — a 2 \ (b-a)(a + b) (b 2 -a 2 \ lb 2 -a 2 \ n 

)—2—i~H~r 0 

w l = b — a (6-17) 


We therefore find the midvalue integration formula to be 

fj(x)dx~(b-a)f{^-} (6-18) 

We see that midpoint integration developed in this manner results in 
rectangular integration. That is, the area formed by the rectangle sampled 
at the midvalue is identically equal to the area under the tangent line at the 
midvalue of the interval. At first it might seem paradoxical that the 
low-order rectangular integration could be as good as trapezoidal in¬ 
tegration—that formulas based on a single point of / could be as accurate 
as a two-point trapezoidal formula. In fact, rectangular integration can be 
made as precise as desired if the sample point on a bounded interval can 
be varied until the mean value theorem of calculus is satisfied. Once again , 
rectangular integration can be made as precise as the true integral pro¬ 
vided that the point at which the function is sampled on the interval can be 
determined, so that the rectangle formed by the sampled value and the 
lines connecting the end points of the function on the interval and the 
abscissa itself have the same area as that under the function bounded on 
the interval. This fact is reflected by equation 6-18. 
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Again, to find the truncation error term, we use the Taylor series: 
(x-a) (x — af 

Ax) =/(«)++ -jr -/*(«)+• • • 
where, upon substituting in both sides of equation 6-18. we find 


(6-19) 




( b-a ) 


( 6 - 20 ) 


This is usually simplified to 


(b-a) f"(a ) 
20 


( 6 - 21 ) 


or, more generally, for a < 9 < b 


(b — aff"(6) 

£ ~ 20 

Comparing equations 6-21 and 6-12, we see that midpoint rectangular 
integration is more accurate than endpoint trapezoidal integration even 
though the rectangular integration is based on knowing the function at 
only one point while the trapezoidal rule "of integration requires the 
knowledge of the function at two end points. 

Extending the midpoint integration formula, we find, as in the 
trapezoidal formula, the composite midpoint integration formula to be of 
the form 

f f(x)dx 

J a 


= Ax 



+/(«+ ^Y^) + f( a 


+ 


5Ax 

2 




+ e 


where its error formula is given by 

, /"(g), ( a<e<b) (6-22) 

Note also that extended trapezoidal integration can be modified to 
include end points outside the interval [a,b]. The modified trapezoidal rule 
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is given by 


j f(x)dx = Ax 


2 


+ f(a + Ax) + f(a + 2Ax) + • • • + 


m 

2 


+ ^[-f(a-bx)+f(a + Ax)+f(b-Ax)-f(b + iLx)] (6-23) 

where the error associated with modified trapezoidal integration is given 
by 


\\(b-a)Ax 4 

£= -720- (a + Ax)<9<(b + Ax) (6-24) 

which is usually much more accurate than extended midpoint integration 
with only slightly more work. 


Other Popular Definite Integration Formulas 

Simpson’s rule, perhaps the most commonly used integration formula, is 
given by 


r 2A-X A y. 

f f(x)dx= %^(/o + 4/i +/ 2 ) (6-25) 

Its associated error formula is given by 

( o<e<2Ax ) 

Simpson’s rule has the nice property that it integrates cubics exactly even 
though it samples only three points of the integrand and in addition has 
very small error terms when Ax is less than 1 and on the order of one-half. 

Simpson’s rule can also be extended (on an even number of intervals) 
according to the formula 


/* "^2/f A v* 

J xo /M dx=f (/„ + 4/, + 2 f 2 + 4/, + 2/ 4 + ■ ■ ■ +/ 2 „) (6-26) 


Its error formula is given by 


nkx 5 




(.v 0 < 0 < x 0 +2nAx) 


€ = 


90 


(6-27) 
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Perhaps the simplest extended integration formula is the Euler-Maclaurin 
formula: 


j x ”f( x )dx — +/i +f 2 +f .3 + 


4 


B^x 2 

2! 



2k\ 



It has the error formula 


(6-28) 


e 2 k = 


enB 2k+2 Ax (2k + 3) 
(2k + 2)\ 


max \f(x) 2k+2 \}, (6-29) 

Xq -X X n ) 


Here B 2k is a Bernoulli number. 

The three-eights rule for definite integration is given by 

r x 3 3 A v 

jj{x)dx=XXL Uo+ 3/, + 3/ 2 +/ 3 ) (6-30) 

Its associated error formula is 

3 \ v 5 

*o<0<*3 (6-31) 

Two types of formulas are used for quadrature when many sample 
points of the integrand are known: Bode’s definite integral formulas and 
the Newton-Cotes formulas of the open type. Bode’s rules for quadrature 
are shown in Table 6-1, and the Newton-Cotes formulas are tabulated in 
Table 6-2. 

The high-order formulas, such as the Newton-Cotes and Bode’s formu¬ 
las, can have some very undesirable properties for large n. For some 
analytic and discrete functions the sequence of the integrals of the in¬ 
terpolating polynomials does not converge toward the integral of the 
function. Also, the coefficients in these integration formulas are large and 
of alternating sign, which is undesirable from the standpoint of propagat¬ 
ing roundoff error. It is primarily for these reasons that the Newton-Cotes 
formulas are rarely used for high values of n. For lower values of n they 
can be simplified to some other well-known formula, such as the previously 
discussed trapezoidal formula and Simpson’s rule. Although Bode’s rule 
gets around the alternating signs associated with the Newton-Cotes formu¬ 
las, it too has convergence problems for certain occasionally encountered 
functions. Suffice it to say that the extended trapezoidal integration with 
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Table 6-1 Bode’s Definite Integration Formulas for Integrating Functions Whose 
End Points Are Known 


Integration Formulas 


Error Formulas 


jH 4 /(x)<k= ^(7/ 0 + 32/ 1 + 12/ 2 + 32/ 3 + 7/ 4 ) 


8A* 7 / VI (0) 

945 


fj(x) dx = ^ (19/ 0 + 75/, + 50/ 2 + 50/ 3 + 75/ 4 + 19/ 5 ) 
f*‘f(x)dx= ^(4l/ 0 + 2l6/, +27/ 2 + 272/ 3 + 27/ 4 + 216/ 5 + 41/ 6 ) 


275Ax 7 / v1 (0) 

12096 

9Ax 9 / VII, (0) 

1400 


jf * 7 /(v) ^ (75 l/o + 3577/, + 1323/ 2 + 2989/3 + 2989/ 4 

+ 1323/ 5 + 3577/ fi + 751/ v ) 

Qf(x)dx = (989/o+5888/,-928/ 2 + 10496/ 3 -4540/ 4 

+ 10496/ 5 — 928/ 6 + 5888/ 7 + 989/ 8 ) 


8183Ajc 9 /$ n 
518400 “ 


2368 Ax'/* 
467775 


end effect modification has high accuracy, does not propagate roundoff, 
requires only a reasonable amount of work in computing the integral of 
any function, and is thus recommended for analysis on the pocket calcula¬ 
tor. 


6-5 INDEFINITE NUMERICAL INTEGRATION 

Indefinite numerical integration is the numerical method for solving 
differential equations. Given the equation 

dv 

-fc-Axj) (6-32) 

we would usually solve it by indefinite integration as follows: 


T=7o+ f 

J x 0 


(6-33) 
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Table 6-2 Newton-Cotes' Definite Integration Formulas for Integrating Functions 
Whose End Points Are Undefined or Unknown or Are Singular Points 


Integration Formulas 

Error Formulas 

0 W rfx=M£(/ I+ / 2 ) 




j* ! f(x)dx= —(Ufy +/ 2 +/ 3 + 11/ 4 ) 

95Ax 5 /*iv/ /) \ 

144 j {6) 

/7«* = 1 Vl - 14/2 + 26/3 

-14/4+ll/ 5 ) 

41Ax 7 

140 / (0) 

jH/Wrf* = (611/, - 453/2 + 562/, 

+ 562/ 4 - 453/ 5 + 61 1/ 6 ) 

5257 Ax 7 ,-vi 

8640 J(0) 

fj(x) dx = ^ (460/! - 954/2 + 2196/3 - 2459/4 

+ 2196/ 5 —954/ 6 + 460/ v ) 

3956A* 9 r vii 

14175 


It is apparent that the solution of the differential equation depends on its 
own evaluation of the integral. This is precisely the chief problem in 
indefinite integration; that is, indefinite integrals are in an implicit form. 
Note that an explicit indefinite integral takes the form 

T=7o+ f f(*)dt (6-34) 

Jx Q 

which is a special case of the differential equation 
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Clearly, this type of numerical integration can be performed analytically, 
hence is not of concern here. 

The simplest indefinite numerical integration algorithm is Euler’s in¬ 
tegration formula: 


y »+' =} ''’ +Ax ('^) ( 6-36 ) 

Here we see that a new estimate (y„ + 1 ) of y is based on the old estimate 
(y n ) and its derivative [(dy / dx) n \. The derivative is usually calculated 
directly from the differential equation once y is estimated. Since the new 
estimate y n + l is based on the old estimate^ and the old value y n it is 
clearly an “open-loop” process where the new value y n is based on an 
extrapolation from previously known data and thus is subject to extrapola¬ 
tion errors. The process of determining new values of y is really a simple 
extension of determining the direction field associated with a solution of a 
differential equation. In general, the approach is to start at some initial 
condition (x 0 ,y 0 ) and calculate the slope, using the differential equation: 


y'o=f(Uy 0 ) 

One then moves an interval A* in the direction of the slope to a second 
point, which we now regard as the new initial point, and repeat the process 
iteratively. If small enough steps are taken we can reasonably hope that the 
sequence of solution values given by this procedure will lie close to the 
solution of the differential equation. In general, all of the elements of 
solving differential equations using numerical indefinite integration are 
present here. A table of the values of x,y,y', and Ay must be computed at 
each step in the numerical integration process. Also, the problem must be 
defined by specifying not only the differential equation and its initial 
conditions, but also the interval over which it is desired to solve the 
equation. It is then possible to select a convenient integration interval, and 
an integration formula that is accurate for that interval. For example, 


with initial conditions y = 0, x = 0. When integrated with Euler’s integration 
formula 


requires a specification of the interval Ax. The simplest approach is to 
experimentally determine the Ax that will accurately (as judged by the 
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analyst) integrate the differential equation. Consider solutions of this 
differential equation with Ax = 0.05,0.1,0.2, and 0.3. The results are tabu¬ 
lated in Table 6-3. A comparison of the numerically integrated solutions 
with the exact solution shows that the sensitivity of the solutions accuracy 
depends strongly on the integration step size. This is true, in general, for all 
numerical integrators when the integration step size is even a reasonable 
fraction of the “response time” of the differential equation.* 


Table 6-3 Solution of dy / dx- 

= e y — x 2 



X 

Exact 

Solution 

y 


Euler Integrated Solution 


Ax = 0.05 

Ax = 0.10 

Ax = 0.2 

> 

II 

© 

Lj 

0.0 

0.0 





0.1 

0.09498 

0.09694 

0.09900 

— 

— 

0.2 

0.17977 

0.18261 

0.18557 

0.19200 

— 

0.3 

0.25389 

0.25672 

0.25964 

— 

0.27300 

0.4 

0.31667 

0.31872 

0.32077 

0.32506 

— 

0.5 

0.36731 

0.36786 

0.36833 

— 

— 

0.6 

0.40488 

0.40329 

0.40152 

0.39756 

0.39333 

0.7 

0.42839 

0.42407 

0.41942 

— 

— 

0.8 

0.43686 

0.42923 

0.42119 

0.40395 

— 

0.9 

0.42929 

0.41782 

0.40582 

— 

0.35277 

1.0 

0.40477 

0.38895 

0.37264 

0.33749 

— 

A disadvantage of 

the Euler 

method is that it introduces systematic 

phase shift or lag (extrapolation) 

errors at each step. The procedure can be 

modified (modified Euler integration) to give better results- 

-that is, greater 

accuracy for essentially the same method and the same amount of work. 


6-6 THE MODIFIED EULER INDEFINITE INTEGRATION METHOD 


An alternative to introducing lag into the calculation is to arrange the 
sampling so that the integrand is sampled not at the end point of the 
interval over which the integration is taking place but at the midpoint. This 
is similar to the development of the midpoint trapezoidal formula devel¬ 
oped in Section 6-4. The task is to perform the integral 

f n+ 'y'(x)dx (6-37) 

J X„-\ 


approximately the time required to move from one equilibrium condition to another 



168 


Numerical Integration 


using the midpoint formula (see Section 6-4). We wish to predict the next 
value of y based on present and past values of the independent variable. 
The midvalue prediction leads to 

/’»+i=>’„-i+ 2A ^» ( 6 - 38 ) 

Using this predicted value, we can now compute the slope at the predicted 
solution point, by way of the differential equation, 

ti+l=A x «+l’Pn+t) ( 6 ‘ 39 ) 

and then apply the trapezoidal rule developed previously to update the 
estimate of the predicted solution point: 

yn+\=y n +^[-(P'n+\+y'n) (6-40) 

The correction is called the corrected value ofy„ +1 . It is apparent that we 
are using the average of the slopes at the two end points of the interval of 
integration as the average slope in the interval. 

In summary, this method has three steps: 

Step 1 Predict the value of y n + l , given the formula 

p n +Y=yn-x + 2 ^y' n (6-4i) 

Step 2 Compute the derivative at the predicted value, using the differen¬ 
tial equation that describes the system: 

Pn + l=f( X n + \’Pn+\) (6-42) 

Step 3 Make a second estimate of the value of y n+v using trapezoidal 

integration: 


y n+i =y n + i f(y'+y' +l ) (6-43) 

This process of prediction and correction has led to the naming of this 
type of integration as the predict-correct concept of numerical integration. 
A number of predict-correct algorithms are tabulated at the end of this 
chapter; they can be used for indefinite integration of differential equa¬ 
tions on the pocket calculator. 
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6-7 STARTING VALUES 

In our previous analysis we assumed that we had values for the dependent 
and independent variables at the starting or initial point. However, the 
algorithm requires not only starting values, but also earlier values. Th£, 
previous values can be obtained in two ways. They can be computed on 
the pocket calculator, or they can be analytically hand calculated. Both 
methods will be presented here. 

The hand calculation method is based on the use of the Taylor series 
expansion of the function: 

y(x + Ax)=y(x) + Axy'(*)+ ^~y"( x)+ ■ ■ • (6-44) 

The derivatives to be evaluated in the Taylor series expansion can be 
found from the differential equation by repeated differentiation. The 
number of terms of course depends on the step size and the accuracy 
desired. But, again, these are matters that can all be easily evaluated on the 
pocket calculator and the number of terms required can be empirically 
determined by continuing to take them until the desired accuracy is 
achieved. 

The method for machine calculation is based on repeated use of the 
corrector formula. Again, if we are given the initial point (x 0 ,y 0 ), we can 
estimate the earlier point (x_ 1 ,y_ ] ) by way of the “unmodified” Euler 
integration, working backwards as follow^: 

x_ { = x 0 — Ax 

y _ l =y 0 ~ Axy' 0 (first estimate of y_ { ) (6-45) 

We can use the estimate of the* previous value of y combined with the 
differential equation to evaluate the derivative at the previous value of y. 
The trapezoidal corrector formula can then be repeated to iteratively 
correct the previous estimate until it achieves the accuracy desired for the 
calculation. The system of equations for the correction process become 

/_!= f(x_ l ,y_ l ) (first estimate ofy'_ L ) 

Ax 

y_ x =y 0 — 2 ~(To+T-i) (second estimate ofy_j) (6-46) 

/-!=/(■*- l>T-i) 


(second estimate ofy'_,) 
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If, after a few iterations, the previous value of y does not stabilize, the 
integration step size can be halved, the previous value of ^ computed, 

and the process repeated to computeAnother alternative is to use the 
value ofy„_ 1/2 to estimate the value ofy„ + 1 / 2 , the process repeated to take 
a half step forward toy„ +r and then these values used as the starting 
values for the predict-correct integration algorithm. 


6-8 ERROR ESTIMATES AND MODIFYING THE PREDICT-CORRECT 
PROCESS 

The predictor formula just discussed is a midpoint integration formula that 
has the error equation 

A v 3 

= (6-47) 

The corrector formula given in Section 6-3 has the error formula 

A v 3 

< c = -~y"\0) (6-48) 

Since these error formulas are of opposite sign, the difference between the 
predicted value and the corrected value gives 

y p =Tc ? -(Texact- Cp) ~ (Texact” € c) ( 6 ‘ 49 ) 

Thus at any given step the difference between the predicted value and the 
corrected value is 


--^A x y u (ff) (6-50) 

Furthermore, we see from equation 6-49 that approximately four-fifths 
of the difference results from the predictor component and one-fifth from 
the corrector component. It is a natural extension of the predict-correct 
technique, then, to modify the integration process slightly as we proceed. 
When we predict with the equation 

+ 2Axy„ (6-51) 

we might immediately modify the value of this prediction, using the 
previous value of the predict-correct difference and the formula 


m „+\=Pn+\-l(Pn- C n) 


( 6 - 52 ) 
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Then we use the differential equation to compute the modified derivative: 

"Cm =/(*„+ !.">„+,) (6-53) 

which is then corrected by way of 

Ay 

c n +i=J'»+^-«+i+>’;) (6-54) 

leading to the final value of y n+l : 

y„+ 1 = £, „-m + i(Pn+ 1-c»+i) (6-55) 

Clearly, this procedure of predicting, modifying, correcting, and modify¬ 
ing again is about the extent to which we can go in solving differential 
equations on the pocket calculator. More advanced methods become too 
cumbersome. 


6-9 OTHER USEFUL INDEFINITE NUMERICAL INTEGRATION 
FORMULAS 

A number of commonly used predict-correct algorithms are convenient for 
pocket calculator solution of ordinary differential equations. The proce¬ 
dure, of course, is always the same. A data table for numerically evaluating 
the solution of the differential equation is prepared and then the integra¬ 
tion formulas are used directly as written. Writing them in alternate forms 
does not buy much in the way of reduced number of key strokes or of data 
entries in the actual integration process. 

The two most popular point slope formulas are the Euler predictor or 
midvalue predictor formulas: 

yn*\=y n +^ x y'n’ (e~A* 2 ) 

y„ + ,=y„-, + 2Axy', (e—Ax 3 ) 

They are usually used in conjunction with the trapezoidal corrector for¬ 
mula: 


i =y„ + ~y W+ 1a* 3 ) 


Another popular and extensively used predict-correct method is the 
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Adams method. Adams’ predictor and corrector formulas are given by 

/„ +1 = 59 >"» -! + 3 7>;' _ 2 - 9>>' _ 3 ), (e—A* 5 ) 


yr+i=y n +^( 9 y^ l + Wy'„-5y'„->+y’„-2), (*~ a * 5 ) 

These four-point formulas obviously require a substantial number of 
operations on the pocket calculator if done manually. In fact, each step 
involves at least 22 key strokes not including the derivative evaluation 
(which is problem dependent). The author has integrated a number of 
differential equations using Adams’ formulas, but they have all been 
first-order differential equations (though of a complex nonlinear nature); 
their evaluation (though time consuming) can be done conveniently be¬ 
cause of the fairly large step size that can be taken for equivalent accuracy 
with the point slope formulas. The numerical stability of these methods 
and the roundoff error associated with the alternating sign of the 
coefficients lead to difficulties, however; hence the lower-order integrators 
are recommended for manual numerical integration on the pocket calcula¬ 
tor. The programmable calculator, on the other hand, can conveniently use 
the higher-order integration formulas and take advantage of their higher- 
order accuracy. The higher-order functions are therefore discussed here. 


Runge-Kutta Methods 

The Runge-Kutta methods are based on implicitly developing increasingly 
higher orders of Taylor series expansions of a function through combina¬ 
tions of the derivatives of a function numerically evaluated on certain 
intervals of the independent variable. The Runge-Kutta methods are yet 
another variant using the Taylor series expansion method and thus are 
limited in the sense that, if the integrand is not Taylor series expandable or 
is to be evaluated across a discontinuity, the location of the discontinuity 
must be determined and the solution is computed up to the discontinuity 
and then restarted at the discontinuity. The advantage of the Runge-Kutta 
methods is that they require no starting values. 

The second-order Runge-Kutta method is given by 


y,+ i=yn+ 2(^1 +k 2 ), O—Ax 3 ) 


(6-56) 
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where, 

k i = \xf(x„,y„) 

k 2 = kxf(x n + Hx,y n + k ] ) 

The Runge-Kutta methods use Euler integration at each step. Thus, to 
evaluate equation 6-56, it is necessary to compute both k x and k 2 ■ To 
compute k 2 , the predicted value of y (y n +k x ) must be evaluated. It is 
apparent that this is equivalent to Euler’s method. Thus the procedure 
consists of first using Euler’s method to compute the first estimate of y n+l , 
which is then used along with .x: n + A;c to compute the value of the 
derivative at x(/7 + Ax) to get k 2 . Then equation 6-56 is formed, using k x 
and k 2 . 

Another form of Runge-Kutta’s second-order equation is 

y n+ \=y„ + k 2. ((-A* 3 ) 

k, = b.xf(x„,y n ), (6-57) 

* 2 = A */(*„ +4*0-. +y) 

In this form of the Runge-Kutta equation, k x is employed to make a half 
step from x n to x n -f y*% where y n is evaluated as+ ^. Then the derivative 
at this midvalue, defined by + y is computed and used to estimate the 
midvalue rate from which k 2 is calculated. Then equation 6-57 is numeri¬ 
cally evaluated using only k 2 . Again, first Euler integration must be used to 
make the first half step, and the first full step is taken by means of the 
midvalue estimates of the rate on the interval. 

Another Runge-Kutta method is also given in two forms. One is 

k x 

y n + 1 — y n y t (€~Ax ) 

k\ ~ ^xf(x n ,y n ) 

A: 2 = A^/|x+~,y„+yj 

k 3 = A xf(x n + A^,^ + 2 k 2 -k x ) 

This is the most popular and convenient form of third-order Runge-Kutta 
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integration, and is used by Hewlett-Packard in its Math Pack 1-36A 
solution to the first-order differential equation for its programmable poc¬ 
ket calculator. 

Another form of third-order Runge-Kutta is 

y n +i = y n + (e—A* 4 ) 

k x = bxf(x n ,y n ) 

k 2 = \xf{^x n + ,y„ + 

k 2 =a.«/| a„+ o'*+ -y 


Though these equations look formidable, their solution involves only 
three steps of Euler integration at the most. Again, the advantage is that 
they require no starting values. 

The two most popular forms of the Runge-Kutta fourth-order numerical 
integration are the following: 


k i At k-i kt 

y n +i = y n + -g - + y + y + y > (e~A* 5 ) 


ki = kxf(x n ,y n ) 

k 2 = bxf(x n +^-,y n +Y 


A ' 3 = Av/^A„+^,>'„+yj 
k 4 = Axf{x ri + kx,y n + k 3 ) 

and 


y n +i-y n + t + 


3k, 3k 3 
+ -~ 



( C ~Aa 5 ) 


k l =\xf(x„,y n ) 

k 2 = ^ x f[ x . + ^ + -j) 
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*3 = X n + ,y n + k 2 - y j 

k 4 = ^xf(x„ + Ax,y„ + k,- k 2 + k t ) 


In all of the methods presented in this section the differential equation is 
assumed to be of the first order and generally written in the form 
y' — f(x,y,). Since an nth-order differential equation can be written in terms 
of n first-order differential equations, these methods are applicable to 
systems of equations or to higher-order equations. 

A number of specific methods are available for higher-order differential 
equations, and in those cases special predict-correct algorithms can be 
developed. Although, for general-purpose computing, they are not very 
useful for numerical evaluation of the solution of the differential equation, 
they simplify the number of calculations on the pocket calculator. For 
example, Milne’s predictor-corrector algorithms for first-order differential 
equations take the forms 


P 

C 


p H y„+\=y„-3+ 

c i i yn-n-y„-i + ^r(yh-i +4 yl, + yt,+ i) 


(e~Ax 5 ) 


>.tl=)’,-5 + + ,4 X-3 + 1 W-<) 

9A V 

y n + 1 y n — 3 ^W + , + 23>>;+12X_ 1 +32r;_ 2 + 7^_3) 




The equivalent accuracy Milne predictor-corrector formulas, for second-, 
and third-order differential equations, are written as follows: 

P | f >n+ I = *3'n-2 + 30'„— >>„_,) +A-V 2 ^" ~y^-l), (t~Ax 5 ) 

C J i y n+ 1 =*y„ + (/„+1 +y'„) - ~-(y ” + , -y") 

A Y^ 

) I y*+l = 3 'n -2 + 3 (.y jl —y„-i) 3 2 ~(y’n' —y'n- l) 

> 1 , -y n + ^f(y'„ +l +y') - W' + , ->:) 

+^ ( y'; + i+K') 


p 

c 
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For systems of differential equations of the form 
y'=f(x,y,z), z' = g(x,y,z ) 
second-order Runge-Kutta can be written as 


/c 2 

y n +i~y n + y+ y 
4 4 

Z n+l~ Z n + y + ~2 


(e~Ax 3 ) 


k x = Axf(x n ,y n ,z n ) 
t l = Axg(x n ,y n ,z„) 
k 2 = A xf(x n + Ax,y„ 4- k x ,z n + /,) 
4 = Axg (+ Ax,^ + k ,, z„ 4- /,) 


Fourth-order Runge-Kutta for this system of equations takes the form 

k A + 2k 2 + 2 k 3 4- k 4 
-”7- 


z(u+ l) = z„ + 


/, + 2l 2 + 2l 3 + / 4 
6 


^Ax/fx^zJ 
/ i = Axg(x n ,y„,z n ) 

. . J A A ^1 4 \ 

k 2 = Aa/I A„ + y- ,y« + y ,Z„ + - I 

/ Aa ^1 4 \ 

1 2 = Axgl x„ 4- y + y + y j 

^ 3 = 4- ,y n + y ,^ + y j 

4 = Ax/|x b + y - Jn + y + y 
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k 4 = A xf{x„ + A x,y n + k 3 ,z„ + l 3 ) 
l 4 = Axf(x„ + Ax,y„ + k 3 ,z n +l 3 ) 

Another special form of second-order differential equation is 

y" =f(x,yy) 

Milne’s predictor-corrector method for these types of second-order equa¬ 
tions is as follows: 

, , . 4A* „ „ , - //N 

y „*1 + T-y „) 

y '„ + , =x;_, + , +4>>; +>>; + ,) 

The single-step self-starting Runge-Kutta method takes the form 


yn+\=y n + kxy'n+ — (fc, + £ 2 + * 3 ) 
y'n+i = y n ~ -^(k\ + 2k 2 + 2k 3 + k 4 ) 

ki = hxf(x n ,y n ,y' n ) 


(e-A.* 5 ) 


J _ \ r( , Ax , Ajt , \ 

A: 2 - + — ,y„ + + -y— + y j 


A* m ^ + &k\ ^ , k 2 

A xk 


ks-kxf\ x n + ^j-,y n + —~ + -%-y n + y 


k 4 = Ajc/^jc„ + A*,y„ + Axy' n + ,y' + k 3 j 

For second-order differential equations, 


y"=f(x,y) 

Milne’s method takes the forms 

Ax 2 

y n +1 -y n + y n -2-y n -3 + ~y (5yZ + yi- 1 + 5y;_ 2 ) 

/\ 


(<~A* 6 ) 
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The Runge-Kutta method appears as 


yn+l=yn+bx 


y' n + 


k l + 2 k 2 


k } 2 k 2 A:, . 

?-+i-?. + T + — + T' (e ~ Ax) 

k\=bxf(xj>„) 

k 2 = A xf[x n + ~ ,y„ + /„ + ^ fc,) 

k } = hxf[x n + kx,y n + kx/ n + 


In the second-order forms, the Runge-Kutta algorithms involve the 
numerical evaluation of rates by Euler integration of the second-order 
differential equation. For the programmable pocket calculator with limited 
memory, these alternate forms of numerically evaluating indefinite in¬ 
tegrals are particularly useful because they dispense with computing the 
two first-order differential equations that would be required to make up 
the second-order equation in the more general first-order indefinite in¬ 
tegration formulas. 


6-10 T-INTEGRATION 

T-Integration (tunable integration) is a new flexible integration concept 
that permits the integration formula to be tuned to the system of equations 
it is solving. In its simplest form it is written: 

yn**yn-\+^T[yy»+(\-y)yn-i\ 


This equation is based on adjusting the phasing of the integration so as 
to satisfy the mean value theorem (as opposed to numerical integration 
algorithms based on analytical substitution techniques). The parameter y 
controls the amount of transport lead (or lag) imposed on the integrand of 
the integral. For example, y = - \ means that the integrand has been time 
delayed one sample period, while y = + § implies that the integrand is time 
advanced one sample period. Since in the numerical integration of 
differential equations the solution point is not known before it is com¬ 
puted, and thus cannot be made part of the integral, it is estimated using 
an extrapolation formula. It is apparent from the equation for this integra¬ 
tor that the weight of the two coefficients in the integrand is performing 
the interpolation/extrapolation operation. Therefore, an approximate 



References 


179 


equivalent form of the T-integration equation is 

>^=--1+A n (r +1 )A> -1 - rA.- 2 ] 

In its application, T-integration is usually used in the following manner: 

1. The differential equation and the interval over which its solution is to 
be computed are defined together with its initial condition. 

2. The integration step size is set equal to one-tenth of the interval size or 
one-tenth of the shortest period in the oscillations of the solution expected 
for the differential equation, whichever is smaller. If the solution of the 
differential equation is expected to be exponential, or smooth, or of 
monotonic nature, the step size is set at one-tenth the interval over which 
the solution is to be evaluated. 

3. If the integration is an open-loop process, that is, the integrand is not 
a function of the integral, then y is set equal to \ and the differential 
equation is numerically integrated. If, however, the integrand is a function 
of the integral, y is set equal to f. When closed-loop integration is 
performed, the sequence of solutions can be plotted and the points con¬ 
nected with straight lines. 

4. The solution can then be compared with check cases that may be run 
at smaller integration step sizes or with empirically prepared check cases. It 
is generally found that the solutions generated from the use of the 
T-integrator “lead” the check case by approximately one integrating in¬ 
terval or slightly less. 

Note, however, that the dynamics of the solution prepared with the T- 
integrator match the dynamics of any check case. That is, although the 
T-integrator, which is a low-order integrator, permits accurate simulation 
of the dynamics of a discrete process, it does so at the sacrifice of a slight 
phase error. Nevertheless, in many engineering applications it is sufficient 
for determining, for example, peak overshoot, natural frequency, damping, 
resonant frequencies, and conditions of dynamic instability, which are the 
purpose of the analysis. In general, it must be remembered that all of the 
integration formulas presented here are usually not for generating of 
numbers to six places, but rather for solving problems and understanding 
the dynamics of processes for the purposes of design, test, and evaluation, 
or all of them. 
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CHAPTER 7 


LINEAR SYSTEMS 
SIMULATION 


7-1 INTRODUCTION 

The analysis of linear constant coefficient systems is important because 
they are frequently encountered in the design of continuous processes. The 
dynamic characteristics of a linear systems response to known types of 
forcing functions are usually studied when setting the parameters for a 
system design. In this chapter we discuss the synthesis of recursion formu¬ 
las by which the response of a linear dynamic process to sampled values of 
its forcing function can be conveniently computed. We tailor numerical 
integration and other discrete approximation methods for computing the 
dynamics of continuous processes to pocket calculator analysis. On a 
pocket calculator it is much easier to iterate a recursion formula to 
compute the dynamics of a process than it is to actually conduct the 
numerical integration of the process. Under some circumstances (when 
there are no hard nonlinearities, such as limits, hysteresis, and dead zones), 
it is quite easy to develop the recursion formulas from the integration 
formulas, thus eliminating many steps in the computing of the solution to 
high-order differential equations. In fact, the number of key strokes can be 
reduced 80% with recursion formulas (difference equations) as compared 
to that needed in direct numerical integration of a differential equation. 


7-2 DERIVATION OF DIFFERENCE EQUATIONS BY NUMERICAL 
INTEGRATION SUBSTITUTION 

Examples of many numerical integration formulas have already been 
discussed, such as Euler’s integration formula, rectangular integration, 
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trapezoidal integration, T-integration, and a number of predict-correct 
formulas. We used the differential equation to numerically evaluate the 
derivatives at the initial condition and then from the starting values in the 
integration formulas we predicted the solution to the differential equation 
in the neighborhood of the initial conditions. 

Another use of a numerical integration formula is to form a difference 
equation. Consider the first-order constant coefficient differential equation 

tx + x=Q (7-1) 

where 

x = x(t) 

Q = Q{t) 

T = a constant 

Now consider the Euler integration formula 

*« = *»-\ + T K-\ ( 7 ‘ 2 ) 

We can solve for the rate in the integration formula, using the differential 
equation, as follows: 


( 7 - 3 ) 

This can be substituted back into the numerical integration formula 

*« == *„-i+ j(Qn-\~ x n -i) O 4 ) 

which, when simplified, gives the difference equation 


a: 


n 



*.-■ + 70.-1 


(7-5) 


This recursion formula computes, for example, the 100th step in the 
solution of the differential equation on the basis of data generation on the 
99th step. The indices in the recursion formula keep track of the iteration 
that is being computed when solving the differential equation. They also 
indicate the approximate time at which the solution value will compare 
with x(t), that is, t = nT if the solution begins at T= 0. We shall see later 
that t^nT, but it is sufficiently close to approximately label the time in the 
sequence of solution values of the difference equation. 
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The use of recursion formulas in solving differential equations has two 
advantages. They reduce the number of key strokes needed to evaluate the 
solution of the differential equation on the pocket calculator. And in linear 
constant coefficient processes they permit the use of implicit integration 
formulas. It is these formulas in which the rates of a state variable are a 
function of the state itself. The trapezoidal integration formula is an 
example: 

T 

X n+l = x n + j( x n+l + x n) ( 7 ‘ 6 ) 

Trapezoidal integration computes the n + 1 value of x based on the n +1 
value of x. However, evaluating x in the differential equation requires 
This results in an implicit equation, whose solution is a function of 
itself. When implicit integration formulas are used to derive difference 
equations, the implicit equation can be solved algebraically. For example, 
consider the implicit Euler integration (rectangular integration), which 
takes the form 


x „ = x n-l +Tx n (1-1) 

By way of our first-order differential equation, we obtain 

*„= 7 (&,-*„) ( 7 - 8 ) 

which, when substituted back into the implicit rectangular integration 
formula, gives the difference equation 

+^(Q„~ x n) C 7 ' 9 ) 

Note that this equation is still in implicit form; that is, x n is a function of 
itself. However, it can be solved algebraically as follows: 

T T 

x n+- x „ = x »-l + -a 
T T 

(i+|)x„=x„_,+^e„ 

' ' X " = ( 1+ 7’/t) X "-' + ( 1+ T/r)®" ( 7 ' 10 ' 1 

Let us now compare the implicit and explicit Euler difference equations 
from the standpoints of numerical stability, numerical error, the manner in 
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which the differential equation seeks its final value, and their implementa¬ 
tion on the pocket calculator. 

The stability of these first-order difference equations is completely 
determined by the magnitude of the first coefficient in the difference 
equation. That is, if the term 

. 1 ... i+i 

1 + r/r r 

Implicit Integration Explicit integration 

exceeds ± 1, the difference equation becomes unstable. For example, if 
a = 2 in the difference equation^ = ay n _,, the difference equation takes on 
the solution values shown in Table 7-1. Note, however, that at a = 0.9 the 
difference equation is stable, as shown in Table 7-2. The stability criterion 
in first-order difference equations generally is that the magnitude of a be 
less than or equal to 1. Now, notice the first-order difference equation that 
is generated with the explicit Euler integration. 

Our aim here is to determine the conditions under which the integration 
step size and the system’s time constant allows a stable difference equa¬ 
tion, rather than leading to numerical instability. We, therefore, first 
determine the conditions under which the magnitude of a is less than or 
equal to 1. That is 



Solving the inequality for T/t, we see that the region of stability for the 


Table 7-1 Unstable Response of the 
Difference Equation y n — ay n _ y where a —2 


y n 


1 l 

2 2 

3 4 

4 8 

5 


16 



184 


Linear Systems Simulation 


Table 7-2 Stable Response of the Difference 
Equation y n — a Y n _, where a = 0.9 


n 

y n 

1 

1 

2 

0.9 

3 

0.81 

4 

0.729 

5 

0.6561 


difference equation derived with explicit Euler integration is 


0 < — <2 


r 


On examining the difference equation derived with rectangular integra¬ 
tion (implicit Euler integration), we see that the condition under which 


is 


1 

\ + T/t 


< 1 



Clearly, the difference equation developed with rectangular integration is 
much more stable than that generated by Euler explicit integration. This is 
a specific example of the more general result that implicit integration of 
constant coefficient linear differential equations leads to intrinsically more 
stable difference equations than do those developed with explicit integra¬ 
tion formulas. We therefore concentrate on the use of implicit integration 
formulas in developing difference equations for simulating continuous 
processes. 

Now, let us look at the accuracy of these simulating difference equations. 
Table 7-3 shows the sequence of solution values for the explicit and 
implicit difference equation’s response to a unit step. The greatest precision 
is clearly achieved with the implicit formula. These difference equations 
were tested for an integration step size divided by the time constant equal 
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Table 7-3 Comparison of Implicit and Explicit Integration-Derived 
Difference Equations when T/r = 1.5 


Normalized 

Exact 

Implicit 

Explicit 

Time 

x(nT) 

x(nT) 

Error 

x(nT) 

Error 

1=0 

T 

0 

0 

0 

0 

0 

T 

- =1.5 

T 

0.776 

0.600 

-0.176 

1.50 

+ 0.90 

T 

- =3.0 

T 

0.950 

0.840 

-0.110 

0.75 

-0.20 

1=45 

T 

0.990 

0.936 

-0.054 

1.125 

+ 0.135 


to j, which challenges the stability of the Euler-derived difference equa¬ 
tion. Both equations appear to be stable. However, the implicit difference 
equation is obviously more accurate than is the explicit equation. This is 
another special case of a general property of difference equations derived 
with implicit integration to simulate linear constant coefficient systems. 
The implicitly derived difference equations are generally more accurate 
than those derived explicitly. 

Finally, let us examine the steady state that all these difference equa¬ 
tions achieve. To do so, we must examine the nonhomogeneous equation 
(since in a homogeneous equation .all the end conditions of the steady 
states approach zero, thus making comparison impossible). For the con¬ 
tinuous and discrete equations, the step response has the forms shown 
below: 


y = Q( i-e~' /T ). + 

Exact V T J . T 

Explicit 


y n = 


1 


T/t 

I + T/t ) y "-' + [ I + T/t 
Implicit 


Qn 


In the steady state 
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Thus we can write the final value as follows: 

\imy(t) = Q y n = Q„ = Q„_ i y„ = <2„ 

Exact Explicit Implicit 

In summary: both the explicitly and implicitly derived difference equa¬ 
tions achieve the same final value for the unit step forcing functions which 
is the final value for the true continuous process. But an implicitly derived 
recursion formula is more stable and more accurate than its explicitly 
derived counterpart. 

Now let us compare the numerical integration of the differential equa¬ 
tion with that achieved by using the recursion formula. The sequence of 
key strokes required to perform the numerical integration of this first-order 
differential equation 


TX + X = Q 

using Euler’s integration formula is shown in Table 7-4. Table 7-5 shows 
the key strokes involved in the use of the difference equation. 


Table 7-4 Typical Key Stroke Sequences for Numerically 
Integrating tx + x = Q 


Reverse-Polish Algebraic 


(Qn- l) 


(Qn- l) 

RCL 1 

<— x(0) prestored 

- 

- 


RCL <— x(0) prestored 

RCL2 

<— r prestored 

X 

f 1 ! 

RCL3 

<—T prestored 

1 r ) 

X 

X 


(T) 

RCL 1 


+ 

+ 

m 


RCL 

STO 1 


0 

STO 


( )—>data entry. 


□- 


►output. 
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Table 7-5 Typical Key Stroke Sequence for Difference 
Equation Evaluation of tx + x=Q 


Reverse-Polish 

Algebraic 

(On- l) 


(Qn- l) 

RCL 1 < 

~^ — ^ prestored 

X 

’ T\ 

X 

( 

J) 

RCL2 < 

^-x(0) prestored 

+ 

RCL 3 < 

T 

i-(l — —) prestored 

RCL <-x(0) prestored 

X 

X 

1 

T) 

+ 

1-1 

\ 

Lt) 

0 



STQ2 


0 

STO 


( )-»data entry. 
□ —>output. 


And Table 7-6 summarizes the key strokes involved in the precalculation 
and iteration through the first step, the first 10 steps, and then 20 steps. 

We see that even for these simple integrators in this simple differential 
equation the reduction in key strokes using the recursion formulas (8.4% 
and 4.2%) is important enough to warrant the use of recursion formulas. A 
greater number of keystrokes is saved when recursion formulas are used to 
simulate high-order linear systems. 

These recursion formulas are particularly useful in evaluating a system’s 
response to an arbitrary forcing function. Provided that the integration 
step size is small compared with the largest period of interest in the 
oscillations of the forcing function, the recursion formulas can efficiently 
evaluate the system’s response to an arbitrary forcing function on the 
pocket calculator and in particular on the programmable calculator, where 
the implicit difference equations take up much less memory than do the 
numerical integration formulas and direct numerical integration. 

A possible difficulty associated with the implicit integration formula for 
evaluating the response of a system to an arbitrary forcing function is its 
assumption that the forcing function is known at time nT, in order to 
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Table 7-6 Number of Key Strokes (Worst-Case)*" Required to Simulate the Con¬ 
tinuous Process y + ky~f 



Numerical 

Integration 

Euler-Derived 
Recursion Formula 


Reverse-Polish 

Algebraic 

Reverse-Polish 

Algebraic 

Number of 
precalculation 
key strokes 

39 

13 

39 

13 

Number of 
key strokes 
for the first 
iteration 

22 

47 

20 

45 

Subtotal 

(61) 

(60) 

(59) 

(58) 

Number of 
key strokes 
for the tenth 
iteration 

220 

470 

200 

450 

Subtotal 

(281) 

(530) 

(259) 

(508) 

Total number of 
key strokes 
for the twentieth 

iteration 

501 

1000 

459 

958 


"Assumes 13-digit Data Entries. 


compute the response of the system at time n. If the forcing function is of 
the form 


the evaluation of f n requires 




fn=f(y»0 


but since the difference equation is still to compute y n , it is not yet in our 
table of solution values; instead we have only a tabulated value fory n _,. 
In this case we can use an extrapolation formula to estimate y n by way of 
the two past values, or we can usey„_! merely as an approximation of y n . 
This can be done when the forcing function’s components are (from a 
Fourier analysis viewpoint) of lower frequency than is the natural 
frequency of the system described by the differential equation. To achieve 
this, we calculate a few values of the difference equation, assuming in 
evaluating the forcing function that and generating the first few 
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terms of the forcing function /, and use a difference table to evaluate 
whether / is changing rapidly. If the change is rapid, we simply use an 
interpolation formula to make a first estimate of y n based on^_j and 
y n _ 2 . The author rarely finds it necessary to use the extrapolation scheme 
in the practical evaluation of the solution to differential equations. 

This technique of deriving difference equations to simulate continuous 
dynamic processes is extremely useful for simulating the dynamics of 
nonlinear processes. One problem is that most implicit difference equations 
cannot be solved for nonlinear differential equations. That is, the implicit 
equation is a nonlinear equation, and usually only iterative techniques can 
be used to solve it. However, the explicit difference equation is easily 
derived and easily put in a form that can be quickly evaluated on the 
pocket calculator, as opposed to numerically integrating the nonlinear 
equation. 


7-3 STABLE DIFFERENCE EQUATIONS 

Recursion formulas for simulating continuous dynamic processes can also 
be derived by assuming a difference equation of the same order as the 
differential equation to be simulated. Then match the roots of the 
difference equation with the roots of the differential equation and include 
an “adjustment factor” so as to match the final value of the difference 
equation with the final value of the differential equation. All that remains, 
then, is to add another “adjustment factor” to match the phasing of the 
difference equation to the phasing of the solution to the differential 
equation. For example, consider again the simple first-order constant 
coefficient continuous process 


rx + x = Q 

Assume that this equation has a solution of the homogeneous equation 


On substitution, we can derive the indicial equation as 

(rs+\)e ts = 0 

which has the characteristic root 


(7-11) 


(7-12) 


5 = — 


1 


T 


(7-13) 
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Clearly, then, the solution to the homogeneous differential equation takes 
the form 

x = c,<r" T (7-14) 

The solution to the nonhomogeneous equation can be derived with the 
convolution integral where the solution of the homogeneous equation is 
convolved with the forcing function: 

x— f Q(k)e[ (k t)/T ^dk (7-15) 

The complete solution to the differential equation then takes the form 


x = e- ,/ ^j‘Q(k) ek/r dk + c ^ (7-16) 

Similar procedures can be followed for higher-order differential equa¬ 
tions, using either time-domain analysis, Laplace transform theory, or even 
Z-transform theory. 

Let us assume that we are going to simulate this continuous process with 
a difference equation whose roots and final value match those of the 
continuous process. We assume a difference equation: 

x n = ax n _ l + bQ n (7-17) 

A solution to the homogeneous difference equation is of the form 

Xn = Cl e- T ^ (7-18) 

Upon substitution, it leads to the indicial equation for the difference 
equation: 

c l e-" r/r (l-ae T/r ) = 0 (7-19) 

Thus for the roots of the difference equation to match the roots of the 
differential equation, we require that 

a = e~ T /' (7-20) 

This determines the coefficient in the difference equation that accomp¬ 
lishes the pole matching between the difference and differential equations. 
It is clear that the solution to the homogeneous difference equation is 


x„ = e 


( 7 - 21 ) 
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This procedure has now guaranteed that the dynamics of the difference 
equation will match the dynamics of the differential equation because their 
roots are equivalent and they will generate equivalent solution values as 
seen by the exponential decay of both. What remains is to compute the 
final value of the difference equation and match it with that of the 
differential equation. The procedure here is more straightforward in that 
the nonhomogeneous difference equation takes the form 

= + bQ, (7-22) 

where the steady state of the root-matched difference equation is achieved 
when 

Qn~ Qn - 1 


Then 

By including the final value adjustment factor 

b=\-e' T / T (7-24) 

we can make the difference equation achieve the same final value as the 
differential equation. Thus the simulating difference equation takes the 
form 


x n = e- T /X-, + (i~^ T/r )0. (7-25) 

Notice that the homogeneous solution of this difference equation matches 
the homogeneous solution of the differential equation exactly. Also, it 
generates a sequence of solutions that are exact for the step response 
(0(0= £7(0) and will generate solutions that are a good approximation of 
the differential equation’s response to an arbitrary forcing function. Also 
notice that this difference equation is incapable of going unstable, regard¬ 
less of the integration step size (because the term e ~ T/r is always less than 
1 no matter how big T gets provided that T>0). 

From the tabulated values it may appear that there is significant error in 
the solutions generated with the dynamics-matched difference equation 
and that generated with the continuous differential equation. However, 
equation 7-25 makes it clear that the difference equation solutions are 
lagging the continuous solutions. The dynamics are usually identical to the 
differential equation except for this effect of phase shift. Of course, we 



192 


Linear Systems Simulation 


could reduce the step size to bring the two curves closer, but, this is not an 
efficient or correct approach to reducing this kind of error. Or we can 
compensate for this phasing error (transport delay) by determining with 
interpolation at what time the sequence of solutions generated by the 
difference equation matches the differential equation and then including 
that transport time in the tabulation of the sequence of solutions generated 
in the difference equation. Suppose that we know that for the fourth entry 
in a table of solution values the true continuous solution lies somewhere 
between t = 3T and 4T. Using inverse interpolation, we can determine the 
time at which the discrete solution matches the continuous solution and 
then arbitrarily select that time as the reference time from which we count 
nT intervals. 

It is important to remember that the solution values generated with 
difference equations and even with numerical integration formulas are 
operating at a problem time which is different from the sequence of times 
nT. That is, problem time in a discrete approximation of a continuous time 
process is different from the sequence of values nT. Hence the indices in 
the recursion formulas represent the number of iterations, not time nT. 
The analyst must determine the actual timing of the sequence of solution 
values in order to compare them with a true continuous-time check case. It 
is the author’s experience that many engineers and programmers, on large 
digital computers as well as on pocket calculators, overlook this problem of 
timing and try to compare continuous and discrete computing processes at 
times nT instead of recognizing that numerical integration is an approxi¬ 
mating process. There is a timing problem also in the synthesis of simulat¬ 
ing difference equations by dynamics matching. In fact, discrete systems 
are different in their operation on the flow of information in feedback 
loops, whether in numerical integrators or in difference equations. Thus 
the phasing of the two sequences of values between continuous and 
discrete dynamic processes must be taken into account by the analyst. The 
problem really arises only with large integration step sizes, but it is 
precisely then that efficiency is at a premium and, especially on the pocket 
calculator workload is substantially reduced from that for an integration 
step size only half as long. 

Once again we find that pocket calculator may be the analytical tool for 
teaching the difference between discrete and continuous systems dynamics 
and the simulation of one with the other. 

A few of the commonly encountered linear processes and their simulat¬ 
ing difference equations using dynamics matching methods are tabulated 
in Table 7-7. 

It is imperative that when the simulating difference equations are used 
the table of solution values be referenced to the number of iterations 
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Table 7-7 Difference Equations for Commonly Encountered Linear Constant 
Coefficient Systems 


G(s) f(t) Difference Equations for Simulation 


I 

f 

1 

•S' 

I=jf(t)dt 

s? 

+ 

i 

II 

I 

f 

1 

s 2 

/= s ff{t)dt 2 

4 = 2 /„_,-r„_ 2 + 

y _ 

1 

TS + 1 

e~‘/ T 

y,-e' T/ y„-,+0-e- T/r )x, 

y _ 

bs 


y n = e~ T/T y„_ l + be~a T (x n -x n _i) 

X 

x + a 

ar 


y _ 

w n 


y n = Ay n _ l -By n _ 2 + (\-A + B)x n 


s 2 + 2 $w n s+wZ 


A =2e~^ Wn 7 cos {w n T{\ - £ 2 ) 1/2 } 

B = e -2Sw„T 


when 0 < f < 1 


through the difference equations, not to time nT. The comparison of 
discrete solution values with a continuous check case involves timing 
considerations, and it is the analyst’s responsibility to determine the proper 
comparison in a manner similar to that mentioned above. 

The difference equations just developed by dynamics matching methods 
have some very important general properties. These difference equations 
are intrinsically stable if the process under consideration is stable. That is, 
there is no sample period T to cause these equations to become unstable if 
the continuous process that they are simulating is stable. This is because 
the roots of the differential equation are matched with the roots of the 
difference equation; hence if the continuous process is stable, the discrete 
process is stable independent of sample period. Showing that the magni¬ 
tude of the roots of the discrete system are less than or equal to 1 will 
prove this; the very way in which they are formulated shows this to be so. 
For example, in the first-order case that we just developed, when the roots 
of the discrete system are matched to the roots of the continuous system, 
the discrete system root is given by 

e~ T//r 

which will be always less than or equal to 1 for all T>0 and for r>0. The 
only condition on using the difference equation is that the forcing function 
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be sampled at a rate equal to twice the highest frequencies of interest in the 
forcing function. More detail is given in Chapter 5 where sampling rate is 
discussed. 

Also, the final value of the discrete difference equation will always 
match that of the continuous difference equation, independent of sampling 
period and without the final value adjustment factor. That this is so can be 
established by the fact that in the steady state the present and past values 
of the response of the system are the same. When substituted into the 
difference equation, the final value of the response can be computed in 
terms of the input forcing function, which is found to match the final value 
of the continuous dynamic process being simulated. 

There is a limitation in the use of these simulating difference equations. 
Clearly, a second-order continuous system can have three different 
dynamic characteristics: when the two roots of the system are real and 
equal; when they are real and unequal; and when both are complex. The 
dynamics of the second-order continuous system with complex roots is 
damped oscillatory in nature, and the response of the system with real 
roots is nonoscillatory, being damped only. Each case requires different 
types of difference equations. It is important, then, to know where the 
roots are in the complex plane to determine which difference equation is to 
be used. This is particularly true if the coefficients in the differential 
equation are changing with time and are not fixed, as in the case of linear 
constant coefficient systems. When the coefficients are time varying, these 
difference equations can be used for piecewise linear constant coefficient 
approximation, with the results matching closely the numerically in¬ 
tegrated solution of the time-varying differential equation. However, if the 
time-varying roots jump on and off the real axis, switching from one 
difference equation to another is necessary. That is, one difference equa¬ 
tion simulates the dynamics of the process when the roots are real but not 
equal; another difference equation simulates the dynamics when the roots 
are real and equal; and yet another difference equation serves when the 
roots are complex. The choice of the appropriate set of difference equa¬ 
tions is fairly straightforward, but note that the implicit difference equation 
generated in Section 7-2 does not require this changing of difference 
equations and thus might be more applicable from the standpoint of 
quickly simulating continuous processes on the pocket calculator. 


7-4 VARIANCE PROPAGATION 

Computing the propagation of noise through discrete linear constant 
coefficient processes is very easy when the noise is “almost white” and 
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stationary. The approach is to rewrite the difference equation into a finite 
memory form where the response is only a function of present and past 
values of the forcing function. This can be easily developed in the follow¬ 
ing manner. Given 

m p 

(7-20 

/-I j -1 

We wish to find 

2 c k Q„- k (7-27) 

k = \ 

The usual approach is to rewrite x n =f(x n _ l , Q) in the sequence 

x„=f(x„-,,Q) 

X„=fU( X n-2’Q)] 

x „=f[f{f(x„^,Q)}] 


Induction is then used to form the rest of the series. For example, 

■*„ = e- r/ X-. + (l-e- r/ ')e„_, (7-28) 

*.“«- rA [e- r/T *l.-2 + (l-«' r/T )fi,-J + (l-e' I ' / ')a-, (7-29) 

x„ = e-^ Xri _ 2 + (,-e-T/r } lQ^ i + e -T /Vn2] (7-30) 
We can expect the next substitution to give 

x.-e~ 3r '%_, + (l -^ r/T )[e„- 1 + ^ r/T a -2 + ^ 2r/ U- 3 ] (7-31) 

As the sequence of substitutions is continued, we have in the limit 

x„ = (l-e- T/r )(Q„- l + e- T/ 'Q„- 2 + e- 2T/T Qn-3+---) (7-32) 

If this series is truncated, we call it the finite memory form of the original 
recursion formula (infinite memory form) for x n . 

Now, to approximately compute the response of a continuous process to 
a random variable input, the mean squared value of the output is calcu¬ 
lated as a function of the mean square value of the input in the following 
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manner. Assume 




(7-33) 


Then 


HLafljQn-iQn-i 


‘ J 


x n = a l- 2 a f Q i 2 = (2a, 2 )4 


(7-34) 

(7-35) 


provided that (?(/) is stationary and “almost white.” In our example we 
can write 


\=(i-f T/ ')(a-,+^ r/T a-2+'-) 

x„ 2 = (I - e - r / T ) 2 ( e 2 -, + e - 2T/T e„ 2 - 2 + ■ " ) 


* 2 =(i - ^- t/t ) (e 2 _, +e- 2r/T e„ 2 _2 + • • •) 


„2 = (1 +e -2T/r + e -*T/r + . . . )„2 


2_(‘-^ rA ) 2 


(7-36) 

(7-37) 

(7-38) 

(7-39) 

(7-40) 


In general, then, the variance transfer function from input to output for 
almost-white and stationary noise inputs is given by 


x ^ 2 

t =2< 


(7-41) 


It is clear from these equations that if the process being simulated is 
unstable, the propagation of roundoff and truncation error is also unstable. 
If the process being simulated is stable, however, the roundoff and trunca¬ 
tion error will eventually reach the equilibrium condition set by the 
variance transfer function. All that remains is to compute the variance of 
the roundoff or truncation error, which can be done by other means 
covered extensively in others books on numerical analysis. Suffice it to say 
that if part of the task is to analyze the continuous system’s response to 
noise, these variance propagation transfer functions can be used to 
approximately predict the continuous system’s response to noise input. 



References 


197 


7-5 REFERENCES 

For this chapter refer to J. M. Smith, “Recent Developments in Numerical 
Integration,” AS ME Journal of Dynamics, Measurement, and Control , 
March 1974, pages 61-67, and M. E. Fowler, “A New Numerical Method 
for Simulation,” Simulation , Vol. 4, May 1965, pages 324-330. 



CHAPTER 8 


CHEBYSHEV AND RATIONAL 
POLYNOMIAL APPROXIMATIONS 
FOR ANALYTIC SUBSTITUTION 


8-1 INTRODUCTION 

In this chapter we are concerned not so much with the numerical evalua¬ 
tion of functions or analyzing data as with deriving polynomials that can 
be used for analytic substitution. Existing handbooks often give the series 
expansion of many advanced functions that, though useful for analytical 
work, converge too slowly to serve in numerical analysis with the pocket 
calculator. These series can be modified to converge more quickly using 
Chebyshev polynomials or rational polynomial approximations. Poly¬ 
nomial approximations for truncated series expansions of functions for 
pocket calculator analysis have the advantage that they can be written in 
nested parenthetical form and efficiently evaluated on the pocket calcula¬ 
tor with high precision. The objective here, then, is to improve the 
convergence of series approximations of given functions. 

The Chebyshev polynomials can be used in a unique process, commonly 
called economization, to transform a truncated power series expansion of a 
function into a more quickly converging polynomial. They can, therefore, 
transform tables of infinite series of questionable value for numerical 
analysis into fast converging series (of which the error is well known) by 
the Chebyshev approximation theorem. Again, these polynomials can be 
written in nested parenthetical form for pocket calculator evaluation. In 
short, Chebyshev polynomials make tables of infinite series representation 
of advanced mathematical functions (of which there are a great many) 
eminently practical for pocket calculator evaluation. Because economiza¬ 
tion is so easy to perform and makes large tables of infinite series 
198 
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immediately available, this chapter is dedicated to instructing the analyst 
on the marvelous properties of Chebyshev polynomials and their applica¬ 
tion to conditioning series for improved convergence. 

Chebyshev polynomials have five important mathematical properties: 

1. They are orthogonal polynomials, with a suitable weighting function, 
whether defined on a continuous interval or a discrete set of points. 

2. They are equal-ripple functions; that is, they alternate between 
maxima and minima of the same size. 

3. The zeros of successive Chebyshev polynomials interlace each other. 

4. All Chebyshev polynomials satisfy a three-term recurrence relation. 

5. They are easy to compute and to convert to and from a power series 
form. 

These properties together generate an approximating polynomial which 
minimizes the maximum error in its application. This is quite different 
from, for example, least squares approximation where the sum of the 
squares of the errors is minimized. In least squares approximations the 
average square error is minimized; the maximum error itself can be quite 
large. In the Chebyshev approximation, the average error can be large but 
the maximum error is minimized. Chebyshev approximations of a function 
are sometimes said to be mini-max approximations of the function. 


8-2 CHEBYSHEV POLYNOMIALS DEFINED 

The Chebyshev polynomials are simply defined by the relations 

T 0 (x) = 1 (8-1) 

T n (x) = cos(n9) (8-2) 

cos9 = x (8-3) 

Equation 8-2 shows that the Chebyshev polynomials are orthogonal (with a 
suitable weighting factor), since cosine is an orthogonal function and cos 
( nO ) is a polynomial of degree n in cos 9. 

Noting the trigonometric identity 

cos(n+ l)0 + cos (n— \)9 = 2 cos 9 cos n9 (8-4) 

we can write immediately that 

T„ + { + T„^=2xT„ 

■ '■T„ + 1 = 2xT„-T„_ 


1 


(8-5) 

( 8 - 6 ) 
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Using this recurrence relation for the Chebyshev polynomials we can easily 
generate the successive polynomials as follows: Since 


and 

in equation 8-6 we find 


T x = x 


T 2 = 2xT l -T 0 = 2x 2 -l 


Then starting with 

and 


T x 


x 


T 2 = 2x 2 -\ 

and again using the recurrence formula 8-6, we find 


T 3 = 2xT 2 -T l = 4x*-3x 

Continuing in a similar manner we can form the table of Chebyshev 
polynomials: 


T i = x 
T 2 = 2x 2 — 1 
T 3 = 4x 3 -3x 
T 4 = S>x 4 — Sx 2 + 1 
T 5 =16x 5 -20x 3 + 5x 
r 6 = 32 x 6 -48jc 4 +18v 2 -1 
T 7 = 64x 7 - 112x 5 + 56x 3 - lx 
r 8 = 128x 8 — 256x 6 + 160x 4 —32x 2 + 1 
Note that we can also form a table for powers of x in terms of 
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Chebyshev polynomials by simply solving for the powers of x from this 
table: 


1 = T 0 

x=T t 

2 T 0 + T 2 

X 

3 37-, + r 3 

- V 4 ~ 

4 3T 0 +4t 2 + r 4 

* = - 8 - 

, ior, + 5r 3 +r 5 
*-16- 

. \ot 0 +\5t 2 +6t 4 +t 6 

x =-32- 

7 35r, + 217’ 3 + 7T’ 5 + T-, 

x = - - - 

64 

8 _ 357’ 0 + 56r 2 + 287’ 4 + 87’ 6 + T a 
X = 128 

What is important about the Chebyshev polynomials is that Chebyshev 
proved that of all polynomials of degree n having a leading coefficient of 1, 
these polynomials (when divided by 2 n ~ l ) have the least extreme value in 
the interval 


— 1 < x < + 1 

No other polynomials of degree n , whose leading coefficient is 1, have a 
smaller extreme value than 


in the interval - 1 < x < +1 since max(|r /7 (x)| = |cosn0|)= 1. This is an 
extremely important finding because it says that if we approximate a 
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function in the interval |jc| < 1 with Chebyshev polynomials that are 
truncated at n terms, the maximum error in the approximation is 

1 

2 n~l 

The objective, then, is to find an expansion for function f(x) in terms of 
Chebyshev polynomials: 


/«= £ a n T n {x) (8-7) 

«=o 

The error properties associated with the Chebyshev polynomials are so 
significant that we take a few moments to show heuristically that there are 
no other polynomials with these properties. 

First note that the leading coefficient in the Chebyshev polynomials 
generated with the recurrence formulas for an rtth-order polynomial is 
2 n ~ l . Thus 


T h (x) 

2 n-l 

is a polynomial with leading coefficient 1. Also, since T n {x) is a cosine 
function (x = cos0), in the interval 

0 < 9 < 7T 

there are n + 1 maxima alternating from + 1 to - 1. Clearly our normalized 
Chebyshev polynomial also has n + 1 extreme values on the interval. 

To prove that there is no other «th-order polynomial with leading 
coefficient 1 which has smaller extreme values in the interval, we assume 
that there is such a polynomial and prove that it must be a Chebyshev 
polynomial. 

Assume that there is a polynomial c(x) of degree n with leading 
coefficient 1 which has a smaller extreme value in the interval than the 
extremes of our normalized Chebyshev polynomial. Then 

T J x ) ,. ,,. 

-^ rr -c(x) = J(x) 

is a polynomial that has n +1 maxima alternating in sign n times in the 
interval |jc|< 1; thus J(x) has n roots. But the polynomial formed by the 
difference between the normalized Chebyshev polynomial and the poly- 
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nomial c(*) is of degree (n— 1). Thus a polynomial of degree (n— 1) can 
have n zeros only if the polynomial is zero. If 

TAx') 

J(x) = 0=— y~c{x) 

then 


<•(*) 


TM) 

2«-1 


Because of the power of the Chebyshev polynomial approximation, its 
orthogonality properties are worth examining to make sure that the Che¬ 
byshev polynomials are orthogonal and that a formula can be developed 
for deriving the coefficients in the series expansion of equation 8-7. We can 
determine the orthogonality properties or characteristics for the Chebyshev 
polynomials from what we know of the orthogonality of the cosine 
functions: that is, 

( 0 (m^n) 

cos(m0)cos(n0)d0 )n /2 = 0 ) 

( 7 T (m = /7 = 0) 

Then, substituting 



T n (x) = cos(n0) 
cos 0 = x 


to obtain the orthogonality properties of the Chebyshev polynomials, we 
find 


/ 


l 


l 


0-* 2 )‘ /2 


dx = 



(m^n) 
(m = nj^O) 
(m = n =0) 


( 8 - 8 ) 


We see, then, that the Chebyshev polynomials form an orthogonal set on 
the interval 


- 1+1 


with the weighting function 


w(x) = 


(i -* 2 ) ,/2 
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The Chebyshev polynomials can also be shown to be orthogonal over a 
discrete set of x n . These orthogonality conditions can then be used to 
evaluate the coefficients a n in equation 8-7: 


a ^L 


-1 

+1 




f(x)T„(x) 

0-* 2 )' /2 

fix) 

( l -* 2 )' /2 


dx, (n > 1) 


dx , (« = 0) 


(B-9) 


( 8 - 10 ) 


Similarly, if the function f(x) is only defined on 


• x jP = C0S ^ r (/ ? + i) 

then the coefficients in the expansion 


/(*)- sV„(*) 

n =0 

are given by 


4 2 f(x p )T n (x p ) 

^ /> = o 

i 7V-1 

2 /(*„) 

for the discrete (which is easier to use) or the continuous method for 
preparing a Chebyshev approximation of a function /(x). 

There is another approach to approximating f(x) with Chebyshev 
polynomials that is due primarily to Lanczos. It is powerful and simple to 
use, having the nice property that it will usually improve the convergence 
of any truncated series expansion of a function f(x). We do the following: 

1. Write a truncated series or polynomial approximation of f(x) in 
nested form. 

2. Rewrite the polynomial in terms of Chebyshev polynomials. 

3. Truncate the Chebyshev approximation by an additional one or two 
terms. 

4. Rewrite the Chebyshev polynomials in terms of polynomials in x. 

5. Rewrite this polynomial in nested parenthetical form for convenient 
numerical evaluation on the pocket calculator. 
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For example, if we write a truncated power series representation of a 
function in the form 


/(■*)= 2 ( 8 - 11 ) 

« = 0 

by rewriting equation 8-11 in the nested parenthetical form 

f(x) = a 0 + x(a l + x(a 2 + • • ' +x(a m _ l + a m x)- • • )) 

we can convert this to a series of Chebyshev polynomials by starting at the 
inner parentheses and rewriting it in the form 

a „-i + a », x = a m-\ T o + a m T i ( 8 - 12 ) 

Assuming that at the /ith parenthetical nest 

a 0 T 0 +0i l T l+ • * ‘ +a n T n ( 8 ‘ 13 ) 

we can multiply this by x and add to it the next coefficient in the power 
series a M _ n _ l to get the (/i + l)st nest. Then, by using the relationships 

xT 0 =T, 

X T„ = ZkiiTIkzl ( 8 - 14 ) 

the power series in the «th parentheses is transformed in the («+l)st 
power series in Chebyshev polynomials as 


x n T n t a n - { T n _ l ^ 


+ 


«! + «3 


r,+ r,+ 




(8-15) 

The process for generating the coefficients at any given stage in the 
development of the Chebyshev polynomial expansion of F(x ) can be 
visualized as shown in Figure 8-11. Here the coefficients at a given stage 
are used to generate the coefficients in the next stage according to the 
diagram. 

In this approach, the coefficient associated with the Mth term of the 
original series 


a m x 


m 
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Coefficients at 
nth nested 
parentheses 


Coefficients at 
(n + l)st nested 
parentheses 



Figure 8-1 Process for generating coefficients for the Chebyshev expansion of/(x). 


becomes, in the Chebyshev polynomial expansion, 

^7 T m (x) 

Thus if we truncated the Chebyshev polynomial expansion of f(x) begin¬ 
ning with the mth term, the error would be on the order of [a m /(2 m ~ l )\ 
instead of a m as in the original polynomial approximation (see Table 8-1). 

N 

Table 8-1 Expansions of f N (x) — ]>] a„x n in Chebyshev Polynomials 

n=0 

Expansion in powers of x 

fo~ a o 
f\ = «o+ a i x 
f 2 = a 0 +a l x + a 2 x 2 

f 3 = a 0 +a l x + a 2 x 2 + a 3 x 3 

f 4 =a 0 +a l x + a 2 x 2 + a 3 x 3 + a 4 x 4 


/ 5 = a 0 + a Y x + a 2 x 2 + a 3 x 3 + a 4 x 4 
+ a 5 JC 5 


Chebyshev expansion 

a oTo 

a 0 To+a x T\ 

(«o+y )r 0 +a,r, + (^)r 2 




a 2 a 4 

a ° + Y 


T 0 +( «i + \Ti + ^(a 2 + a 4 )T 2 -\- 


l a 2 a 4 \ 3 a 3 10a 5 \ 

( a ° + T - T') r » + ( a > + T + T6 ") T ' + 


l/a 2 + a 4 
2 1 2 


T, + \ -f + 


t,+( 
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In this sense, then, we say that the Chebyshev expansion converges more 
quickly than does the original expansion. 

Another way of looking at it is as follows. If the original polynomial 
approximation of the function was accurate to some error, the Chebyshev 
polynomial approximation will usually be almost as accurate to the same 
error with fewer terms. Additional terms can therefore be dropped from 
the reexpressed polynomial. This is the process called economization. 

The procedure followed in equations 8-12 through 8-15 shows the 
approach to generating Chebyshev polynomial approximations of /(x) in 
general for n terms. However, polynomials of a degree higher than 5 are 
generally cumbersome for pocket calculator analysis. The table of powers 
of x in terms of the Chebyshev polynomials, presented earlier in the 
chapter and partly repeated here for convenience, is useful for direct 
substitution of powers of x for its Chebyshev polynomial equivalent. 


1 = 7; 


jc = r, 


2 T 0 + r 2 

*- T~ 


3 r, + r 3 


4 3ro+4r 2 +r 4 

X = --- 


10r i + 5T 3 + t 5 
16 


In this way a power series such as 

/(x) = a 0 + a x x + a 2 x 2 + • • • + a M x M 

can be converted into an expansion in Chebyshev polynomials: 

/(x) = b 0 + b x T x + b 2 T 2 + + b M T M 

For this process to be workable the series must be written in a form 
where the evaluation of/(x) takes place for x on the interval ( — 1 <x< + 
1). Once the expansion is written in terms of Chebyshev polynomials, they 
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can be replaced by polynomials in x from our earlier table, repeated here 
for convenience: 


T o = 1 


T, = x 
T 2 = 2x 2 -\ 

T 3 = 4x 3 -3x 
T 4 = 8 x 4 — 8 x 2 + 1 
T 5 = 16x 5 — 20x 3 + 5x 

They can then be algebraically simplified and written in nested parentheti¬ 
cal form for quick evaluation on the pocket calculator. 

Hamming works the easy-to-follow example 

>> = ln(l + *)-*-y + y* (8-16) 


to illustrate the method of economization. By direct substitution for 
powers of x we can rewrite this power series expansion in terms of the 
Chebyshev polynomials as 


y**T x - 




T n 


y= - 


+ 


3 \ T 2 T 

1 + T2> t '~T + 


_3 

12 


T o^ 

y^~T + 


15 r, t 2 t % 
12 _ T + "iI 


(8-17) 


Dropping the last term in the power series (equation 8-16) results in 
dropping 0.25 from the numerical evaluation of y (when x=\); a roughly 
equivalent error is produced in the power series (equation 8-17) when 
dropping the last two terms. This can be seen by noting that at most the 
error will be 


«=T2 — i = -0.1766- - * 

* Hamming carries five terms—more than we need to illustrate the principle for pocket 
calculator polynomials of convenient size. 
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We know this because the magnitude of the value of the Chebyshev 
polynomials is less than or equal to 1 for all * on the interval. Thus we can 
write 


7n 157, T 2 

y = ln(l+Jc)a--^- + -|2- J (8-18) 

with somewhat better accuracy than is given by equation 8-16. This, then, 
is the process of economization. Using the definitions of the Chebyshev 
polynomials we can rewrite equation 8-18 in the form 

>■ = ln(l+*)=— +-^-^ + ^ = *(1.25-0.5.*) 

The numerical comparison of the economized second-order and the non- 
economized third-order polynomials is given below. The economized 
quadratic equation has smaller average error (-0.0676) than the non- 
economized cubic equation (0.1993); it also has the smallest maximum 
error (0.1 at * = 0.5) on the interval 

0 < * < 1 


Exact Economized Noneconomized 


X 

j = ln(l + x) 

ysjc( 1.25 

-0.5*) 

^x-0.5* 2 

+ 0.333* ; 

y 

Error 

y 

Error 

0.1 

0.09531018 

0.12000000 

-0.0246 

0.0948333 

0.0052 

0.3 

0.26236426 

0.33000000 

-0.0676 

0.25050000 

0.0495 

0.6 

0.47000363 

0.57000000 

-0.0999 

0.38400000 

0.2160 

0.9 

0.64185389 

0.72000000 

-0.0781 

0.37350000 

0.5262 


Let us recap Chebyshev polynomials in the context of pocket calculator 
analysis. We saw in Chapter 1 that the evaluation of polynomials greater 
than third or fourth-order involved a sizable number of key strokes on the 
pocket calculator. We found that by writing these polynomials in nested 
parenthetical form we could go to fifth and sixth-order polynomials with 
the same number of key strokes as required for the third-order polynomial 
evaluation. This gave us additional accuracy with the same number of key 
strokes. Now we have found that the economization process due to 
Chebyshev polynomials can occasionally provide high-order polynomial 
accuracy with low-order polynomials, even further reducing the number of 
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key strokes for, say, equivalent fifth-order polynomial evaluation accuracy. 
That is, retaining fifth-order polynomial accuracy with Chebyshev 
polynomials can provide equivalent accuracy of seventh- or eighth-order 
noneconomized polynomial expansions. Then when it is written in nested 
parenthetical form, the Chebyshev polynomial provides this accuracy with 
many fewer key strokes than would normally be required for up to eighth - 
or ninth-order polynomial expansions. Thus the nested parenthetical 
evaluation of Chebyshev polynomial approximations reduces the workload 
on the pocket calculator from that associated with a ninth-order poly¬ 
nomial approximation of f(x) to that of a second- or third-order poly¬ 
nomial approximation of f(x). This results in an order-of-magnitude 
reduction in key strokes. 

In general, the approach to evaluating advanced mathematical functions 
is (1) to write the function in a truncated polynomial form, (2) rewrite that 
expression so that the interval on which f{x) is to be evaluated is between 
— 1 < x < +1, (3) economize the series using Chebyshev polynomials, (4) 
rewrite the Chebyshev approximation to the function in nested parentheti¬ 
cal form, and (5) use it for numerical evaluation on the pocket calculator. 


Numerical Evaluation of Chebyshev Polynomials 

It is useful to know that the recurrence formula (recopied here for 
convenience) for the Chebyshev polynomial can be used to numerically 
evaluate Chebyshev polynomials: 

T„(x)-2xT„_ t (x)-T n _ 2 (x) 

The starting values for the recurrence formula can be computed with: 

7o=l 

T t -x 

Thus the Chebyshev polynomial expansion of a function, once written, 
need not necessarily be given in powers of * but can be numerically 
evaluated directly. For example, the equation 

>>=ln(l + x)a-^ + i|r 1 -^ + ^ (8-17) 

is the Chebyshev approximation to In (1 -I- x), which was developed earlier. 
Using the recursion formula and the fact that T 0 = 1 and T { = x, we can 
now numerically evaluate equation 8-17 by first evaluating the numerical 
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values for the five Chebyshev polynomials; for example, when x = 0.3, 

T x = 0.3 

T 2 = (2)(0.3) T x - T 0 = 2 X 0.3 X 0.3 - 1 = - 0.82 


T 3 = (2)(0.3) T 2 - T x = - 2 X 0.3 X 0.82 - 0.3 = - 0.792 

These can be substituted in the power series expansion to numerically 
evaluate the series: 


y = ln( 1.3) a - i + || x 0.3 - | ( - 0.82) + -L (- 0.792) = 0.2640 
y = ia( 1.3)=0.26236426 

This procedure allows convenient numerical evaluation of high-order 
Chebyshev polynomials (e.g., 20). Although writing the nested parentheti¬ 
cal form of the Chebyshev polynomial expansion of a function is possible, 
it is cumbersome and can trip-up the user if he forgets which parentheses 
he is at in the numerical evaluation process. A better alternative is to 
compute first the numerical values of the Chebyshev polynomials and then 
substitute them into the polynomial expansion equation, since this does not 
directly involve the evaluation of high-order polynomials. 


8-3 APPROXIMATION FOR ANALYTIC SUBSTITUTION WITH 
RATIONAL POLYNOMIALS 

We have seen so far that the expansion of a function in terms of 
Chebyshev polynomials can be used to series expand a function that 
minimizes the maximum error in the approximation on the interval — 1 < x 
< +1. As noted in Chapter 3, the economized Chebyshev approximations 
were not extensively employed in precision evaluation of functions. The 
reason is that they are not necessarily the best approximation for pocket 
calculators—from the standpoint of the time required to evaluate the 
function and the storage needed to store the coefficients. A better 
approach is to use the ratio of two polynomials as a means for approximat¬ 
ing functions. Again, this is so because nested polynomials are con¬ 
veniently numerically evaluated on any digital computer including the 
pocket calculator. 

Consider the case where we wish to represent a function as the quotient 
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of two polynomials: 


/M 


a 0 + a x x + a 2 x 2 + • • • + a n x n 
1 + b x x 4- b 2 x 2 •+■••* + b m x m 


R n ( x ), 


(N= n + m) 


The rational polynomial approximations used here are those whose nu¬ 
merator is equal to or greater by 1 than the degree of the denominator. The 
constant term in the denominator can be taken to be 1 without loss of 
generality. Since our concern here is the interval - 1 < x< + 1, we can use 
the Maclaurin series expansion for f(x) as a means of determining the 
coefficients in the rational polynomial approximation. The number of 
terms that we would use in the Maclaurin series is equal to the sum of the 
order in the numerator and denominator, because this is the number of 
coefficients that must be determined. If we write 

f(x) = tfo + c iX + c 2 x 2 + ■ • • 


for the Maclaurin series expansion of /(x), the difference between the 
Maclaurin series and the rational polynomial approximation can be for¬ 
med as follows: 


f(x)~ R n {x) = (c 0 +c [ x + c 2 x 2 + • • • Tc^x") 


a 0 +a x x + • • • + a n x n 
1 + b x x + • * • + b m x m 


f(x)-R N (x) = 
Now, if 
then 


(c 0 + c,x + c 2 x 2 + • • - + c A ,x' v )(l + b x x +•••-!- b m x m ) 

— (a Q + a x x+ • • • +a n x m ) 


1 + b x x + * • * + b m x* 


(8-19) 


f(x) = R n (x) at x = 0 


c 0 -a 0 = ° 

Similarly, for the first N derivatives of /(x) and ^(x) to be equal at 
x = 0 the coefficients of the powers of x in the numerator must all be zero. 
This gives the system of equations shown below: 

b x c 0 +c j — a x =0 
b 2 c 0 + b x c x + c 2 — a 2 = 0 
b 3 c i3 + b 2 c x + b x c 2 + c 3 — a 3 = 0 
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- \^n — m+\ + 

+ 

aP 

1 

aP 

II 

O 

^n —m + l ^n—\^n —m + 2~^~ 

'' +c «+i = 0 

bnt C n-m + 2+ bm-l C n-m + 3^~ 

' • +C n + 2 = ° 

^n — m "P — 1 Cj — m + 1 "P 

•• +c n = 0 


When combined with equation 8-19, this can be solved for all coefficients 
of the rational polynomial. 

The process just described is that of forming the Pade approximation. 
We illustrate Pade approximations with a simple example. 

Example. Consider approximating sin(x) with a rational polynomial 
where N~ 9. We use a fifth-degree polynomial in the numerator. The 
Maclaurin series expansion through x 9 for arctan (x) is 

Y 3 y 5 X 1 X 9 

arctan(x) = x- — + —-— + ^ = M 9 (x) 


Following the procedure outlined above, we form the equation 


(1 - jx 3 + jx 5 — ^x 7 + ^x 9 )(l + b x x -f- • • • 4- b 4 x 4 ) 


f(x)~R 9 (x) = 


~(a 0 + ■ ■ ■ +a 5 x 5 ) 


1 + b x x -I- b 2 x 2 + • • • T b 4 x 4 


from which we can evaluate the coefficients for the rational polynomial: 


O 

II 

o 

Q 


bl~ 3^3 


a x =\ 




a 2 =b\ 


u> 

II 

sj|0* 


a 3= - 

3+^2 


+ 7^2 

«4= “ 

iV+^3 




a 5 = y — ib 2 + b 4 
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Solving for the a’s and b' s, we find 

a 0 =0 

«i=> 

a 2 = 0 

a 3 =i 
a 4 = 0 

a 5 = m 

b { = 0 

b 2~~9 

b 3 = 0 

£>4= Jl 


Thus the rational polynomial for approximating arctan (x) is given by 


arctan(x) 


x H- ^x 3 + ^x 5 

T+ fx 2 +^-x 4 


x(l + ^x 2 (l + ^fex 2 )) 
l + ^x 2 (l + ^x 2 ) 


The table below compares the rational polynomial approximation (Pade 
approximation) and the Maclaurin series expansion for arctan (x). 


X 

Arctan (x) 

R 9 (x) 

Error 

M 9 (x) 

Error 

0.2 

0.19740 

0.19740 

0.00000 

0.19740 

0.00000 

0.6 

0.54042 

0.54042 

0.00000 

0.54067 

- 0.00025 

1.0 

0.78540 

0.78558 

-0.00018 

0.83492 

-0.04952 


Errors in Pade approximations can be estimated by computing the next 
nonzero term in the numerator of the rational polynomial. The procedure, 
though somewhat tedious, gives an error formula in terms of the next term 
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with a nonzero coefficient. Furthermore, the error formula can be 
evaluated over the interval in which the function is being evaluated. The 
alternative is to work out a simple error curve as shown in our examples 
here; this can be conveniently done on the pocket calculator. 

The rational polynomial approximation is significantly more accurate 
than the Maclaurin approximation. When x—1, the Maclaurin series 
ceases to converge while the Pad6 approximation is still fairly accurate. 

In the pocket calculator evaluation of the rational polynomial approxi¬ 
mation we write the numerator and denominator in nested parenthetical 
form, numerically evaluate each separate polynomial, and then perform 
the division numerically. 

Similarly we could have started with the Chebyshev series for arctan (x) 
on the interval +1 to - 1, formed a rational polynomial in Chebyshev 
polynomials, and then proceeded to evaluate the coefficients in the 
approximation. The procedure differs from that above in that the product 
of the two Chebyshev polynomials results in squares and products of the 
polynomials themselves. In turns out, however, that the product of two 
Chebyshev polynomials can be given by 


T n (x)T m {x)=i (T„ +m (x)+T„_ m (x)) 


here we have again a useful form for evaluating the coefficients in the 
numerator of the rational Chebyshev polynomial approximation formula. 
Once the approximation in Chebyshev polynomials is determined, it can be 
reduced to approximations in the independent variable on the interval + 1 
to— 1. 

As a simple example, consider the Chebyshev polynomial expansion of 
e x given by (see Example 8-2 on page 220) 

e x = 1.2661 T 0 + 1.13037^ + 0.2715 T 2 -(-0.0444 T 3 =/(x) 


Using the principles of rational polynomial approximation, we find the 
approximation R 2 in the form 


*3 = 


^ 0 + 01^1 + a 1^2 
1 +b ] T l 


Next we form the function f(x)-R y Setting the powers of x in the 
numerator equal to zero, we then find 

(1.2661 + 1.1303 T x + 0.0444 T 3 )(l + b l T)~(a 0 + a x T { + a 2 T 2 ) 
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of which the numerator becomes 

1 .266 1 + 1 . 1 306 7 , + 0.27 1 5 T 2 + 0.0444 T 3 + 1 .266 1 b , T x + 1 . 1 303 b , T? 

+ 0.21\5b ] T l T 2 + 0.0444b l T l T 3 -a o -a l T l -a 2 T 2 


Remembering that 


T r,( X ) T m( x ) = \[ T n + m( x ) + ^-mW] 

we can write (equating/(x) to R 3 and equating their first three derivatives, 
respectively): 


a 0 = 1.2661 + 
a, = 1.303 + | 
^ 2 = 0.2715 + 
0 = 0.0444 + 


1.1303 

2 

0.2715 


1.1303 , 0.0444 

r 


b , 

+ 1.2661)6, 


0.2715 , 


Thus 


with the result that 


6,=-0.3266 
a 0 = 1.0815 
a t = 0.6724 
a 2 = 0.07966 


1.0815 + 0.6724 7,+ 0.07966 T 2 
e ~ 1-0.3266 T, 

Then substituting 

T,-x 
r 2 = 2x 2 -i 

we find 

I.0018 + 0.6724jt + 0.1593x 2 
e ' 1 — 0.3266a: 
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A comparison of the two types of approximations is shown in Figure 
8-2. It is apparent that the approximation found by 


e = 


^ 3 + ^3 
2 


+ 


e R + € C ' 
2 


(where C 3 is the Chebyshev approximation of f(x) developed in example 8-2 
on page 220) has the error curve shown as a dashed line in the figure and is 
somewhat more accurate than either approximation alone. 



Figure 8-2 Approximations of e x . (1) Chebyshev alone, (2) rational Chebyshev polynomials, 
and (3) a combination of both. 


Example 8-1. Prepare the rational polynomial approximation sin(x) 
= R 5 (x ) in the neighborhood of x = 0. Then 

a Q + a x x + a 2 x 2 + a 3 x 3 
1 + b x x + b 2 x 2 


R s(x) = 
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.X 3 Jt 5 

sin(*)x - y + - c 0 + c x x + c 2 x 2 + c 3 x 3 + c 4 x 4 + c 5 x 5 

From R 5 and sin(jc) we see that 


c 0 — 0 

«o = ^o 

b o = 

1 

Cj=l 

* 1**1 

*1- 

*1 

c 2 = 0 

a 2 = a 2 

6 2 = 

*2 

c 3 =~i 

*3 = ^3 

6 3 = 

0 

c 4 = 0 

a 

-u 

II 

O 

*4 = 

0 

c 5 = lio 

a 5 = 0 

6 S = 

0 


We know (see text) that the six equations for determining the six 
coefficients ciq , ci^, ci 2 , < 2 3 , b^, and b 2 are 

c o“ a o ==0 

c 0 b x + c x - a = 0 

c 0 b 2 c x b x “f" c 2 a 2 = 0 

c 0 b 3 + c Y b 2 + c 2 by + c 3 - a 3 = 0 


c 0 Z> 4 + c,6 3 + c 2 b 2 + c 3 b x + c 4 = 0 


c 0 b 5 + c ] b 4 + c 2 b 3 + c 3 b 2 + c 4 b { + c 5 = 0 

On substituting directly from the preceding table of a% b% and c’s, we 
find 

a 0 = 0 

a i = 1 

a 2 =b { 

c h ~ ~ z + b 2 

b , 

o=-^.-a=o 

0 = no _ • ^2 = i 
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Substituting for b 2 in a 3 and b l in a 2 , we find 
a o = 0 

a \ ~ 1 
a 2 = 0 

a 3 = (lQ- i)-(l§0 — T%) = ~ T& 
b x = 0 

^ 2 = ^ 

Thus 


sin(x) 




3 


1 + A * 2 


We can tabulate values of this approximation and compare them with 
the Maclaurin series approximation of sin(x) and the actual value of 
sin(x). This is done in Table 8-2. 


Table 8-2 


X 

sin( x) 

*5 

Error 

m 5 

Error 

0.25 

0.24740396 

0.24740395 

1 X 10~ 8 

0.24740397 

- 1 X 10" 8 

0.50 

0.47942554 

0.47942387 

1.67X 10“ 6 

0.47942708 

— 15.4 X 10" 6 

0.75 

0.68163876 

0.68161094 

27.82X10" 6 

0.68166504 

-26.28 X 10" 6 

1.00 

0.84147098 

0.84126984 

201.14x 10~ 6 

0.84166667 

- 195.68 X 10" 6 

1.25 

0.74898462 

0.94806763 

916.99X 10~ 6 

0.94991048 

— 925.86 X 10" 6 

1.50 

0.99749499 

0.99438202 

3112.96 X 10~ 6 

1.00078125 

- 3286.26 X 10" 6 

7t/2 

1.00000000 

0.99577290 

4227.10 X 10~ 6 

1.00452486 

— 4524.86X 10" 6 

2.00 

0.90929743 

0.88888888 

2.040854X10" 2 

0.93333333 

-2.403591 X10" 2 


In the interval between 0 and 1 the Maclaurin expansion is more accurate 
(as we might expect), while the rational polynomial tends to be somewhat 
more accurate outside the (0,1) interval. 

This example makes the important point that rational polynomial ex¬ 
pansions, though involving polynomials of lower powers than do the 
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polynomial from which they were derived, are not necessarily equally 
accurate over all intervals. The procedure of generating rational poly¬ 
nomial approximations of functions is, however, also illustrated in the 
example. 


Example 8-2. Economize the Maclaurin series expansion of e x \ 


x 1 x 3 

> x =\ + x+~ + — 




120 720 


Since 


x=T t 


x 2 =l(T 0 +T 2 ) 

* 3 =!(3 r, + r 3 ) 

* 4 =i(3T 0 + 4T 2 +T 4 ) 
x 5 =-k(\0T l + 5T 3 + r 5 ) 
x 6 =i(107’ o + \5T 2 + 6T 4 +T 6 ) 

we can rewrite e x as 


= T 0 + T, + i( T 0 + T 2 ) + i(3 T, + T } ) + tJj(3 T 0 +4 T 2 + T 4 ) 

+ T920 0 0 ^1 +5 7 3 + • • * ) + 23040 (10 Tg + 15 T 2 +•••)+••• 
e x = 1 .2661 T 0 + 1.13037, + 0.27 1 5 T 2 + 0.0444 7 3 + • • • 
e x =\ .2661 + 1.1303x + 0.2715(2 x 2 - 1) + 0.0444(4x 3 - 3 x) + ■ • • 

£ * = 0.9946 + 0.9971 x + 0.5430.x 2 + 0.1776x 3 + • • • 

Note that the terms involving T 0 , T v T 2 and T 3 were carried from 
substitutions of polynomials for up to six terms in the Maclaurin series. 
Thus we also continue the effect of the sixth term on the first and 
succeeding terms—the effect that makes the economization work. 
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Comparison of the Chebyshev and Maclaurin approximations is shown 
in Table 8-3. 


Table 8-3 


X 


Maclaurin 

Chebyshev 

Error 

M 

Error 

C 

1.0 

2.7183 

2.6667 

2.7123 

0.0516 

0.0060 

0.8 

2.2255 

2.2053 

2.2307 

0.0202 

-0.0052 

0.6 

1.8221 

1.8160 

1.8267 

0.0061 

-0.0057 

0.4 

1.4918 

1.4907 

1.4917 

0.0011 

0.0001 

0.2 

1.2214 

1.2213 

1.2172 

0.0001 

0.0042 

0.0 

1.0000 

1.0000 

0.9946 

0.0000 

0.0054 


Notice that the Chebyshev error is a maximum at x = 0 and the Maclaurin 
error is a minimum. This is because of the osculating nature of Maclaurin 
approximation at the origin as compared with the mini-max nature of the 
Chebyshev approximation on the interval (0,1). This is illustrated in Figure 
8-3. 



Figure 8-3 Error in Chebyshev economization of a Maclaurin series expansion of e x . 
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Example 8-3. Approximate sin(x) with Chebyshev polynomials by the 
method of economization. 

In this simple example, we use the Maclaurin approximation of sin(x), 
again because it is an approximation centered on the interval — 1 < x < + 
1. We see that 


Then 




sinx=Tj 24 T\ T T^} T 1920 + 5T 3 + T 5 ) 

sinx^^ff T l - 128^3 + \mT 5 

The higher powers of x from the Maclaurin series would make further 
contributions to the 7\, T v and T 5 coefficients. The contributions are 
small, however, especially for the early T x terms. The x 5 term, in particu- 



Figure 8-4 A comparison between the error that results from a Maclaurin approximation 
and an economized Chebyshev approximation of sin(x). 
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lar, changes the T x coefficient by less than 1% and the x 7 term alters it by 
less than 0.01%. 

Economizing the Chebyshev approximation (dropping the T 5 term), we 
find 


On substituting 

T x = x 

T 3 = 4x 2 -3x 

we find 


sin(x)» 0.9974;c - 0.1562.x 3 
sin(*) = x (0.9974 — 0.1562.x 2 ) 

The errors for the Maclaurin approximation of sin(x) and the 
economized Chebyshev approximation are compared in Figure 8-4. 


8-4 REFERENCES 

For this chapter consult Richard Hamming’s Numerical Methods for Scien¬ 
tists and Engineers (McGraw-Hill, 1973), Chapters 28, 29, and 30. For 
further reading on some of the examples given in this chapter read Curtis 
F. Gerald’s Applied Numerical Analysis (Addison-Wesley, 1970). The 
author has used Gerald’s textbook in courses on numerical analysis and 
has found it to be detailed and easy to read. 



CHAPTER 9 


DETERMINING THE ROOTS 
OF A FUNCTION 


9-1 INTRODUCTION 

In this chapter we examine in more detail the problem of finding the roots 
of a function, which we encountered briefly in Chapter 3. First we discuss 
the real zeros of continuous functions and then touch on the problem of 
complex zeros. The need to find roots of a continuous function arises 
frequently in engineering, usually when solving implicit equations, deter¬ 
mining maxima and minima, or finding solutions of simultaneous equa¬ 
tions. The methods particularly suited to pocket calculator analysis are 
considered here. They differ from the standard methods for evaluating 
roots on large-scale digital computers in that the analyst must understand 
the function whose roots he is trying to find so as to select the proper 
approach to the problem. 

Three methods for finding the real roots of a function are discussed. 
Perhaps the most straightforward approach is that of bisecting the interval 
over which the root is expected to be identified. The bisection method is 
slowly converging, but is virtually foolproof in its application (i.e., it is 
almost impossible to misuse the method). 

Another approach is a modified form of the “false-position” method 
developed by Hamming. It is a fast-converging method, but involves 
slightly more functional evaluations than does the bisection method. 
Furthermore, for certain functions the false-position method can converge 
more slowly than does the bisection method, but these functions are not 
frequently encountered in practical engineering analysis. 

Finally, there is Newton’s method and its application to finding powers 
or «th roots of a number N (used in Chapter 2). Though it can be used 
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effectively for other numerical evaluations, Newton’s method does involve 
evaluating a derivation and can be a slow-converging process. Once it 
begins to close in on a root, however, it does so essentially doubling the 
number of decimal places to which we know the value of the root at each 
step. In this sense, it is a fast-converging method, once it gets close to the 
root. 

In evaluating the complex roots of a function, we restrict ourselves to a 
method whose search pattern determines the roots only crudely. Their 
accurate evaluation is possible, but the necessary procedures and tracking 
methods are too complex for pocket calculator analysis. The approxima¬ 
tion is usually sufficiently accurate for engineering analysis however. 

The chapter closes with a discussion of the zeros of nth-order po¬ 
lynomials (we covered only first-, second-, third-, and fourth-order 
polynomials in Chapter 2). The search for the zeros of a polynomial is 
treated as a special case in this chapter because much is known about the 
roots of polynomials. Specifically, we know the following: 

1. Polynomials of nth order have exactly n roots; thus we know precisely 
when all roots have been found. 

2. When a zero is found, we can divide it out of the original polynomial 
to obtain a lower-degree polynomial for an easier evaluation of the 
remaining roots. 

3. Most polynomials can be scaled to facilitate root evaluation. 

4. The polynomial can occasionally be factored into its real linear and 
real quadratic factors—an important simplification in evaluating the roots 
of polynomials. 


9-2 THE REAL ROOTS OF CONTINUOUS FUNCTIONS 

To find the real roots of continuous functions mathematically, we must 
find a number x that, on substitution into a function /(x), results in exactly 
zero. On a pocket calculator it is enough to find neighboring values of x 
that, when substituted into /(x), provide nearly zero results of opposite 
signs. In this case we can approximate the zero of the function by using the 
midvalue of the interval. 

In evaluating the zeros of mixed algebraic and transcendental functions 
the number of zeros is commonly found to be infinite. We must therefore 
identify the region in which the zero that we are interested in occurs. This 
can usually be done quickly by sketching the two functions or analytically 
determining by trial and error the neighborhood of our zero, from which 
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we then begin the search for the root. For example, the function 

f(x)==~ -x(~ + 3]sinx (9-1) 

sinx 2 \7T f 

has an infinite number of roots, as seen in Figure 9-1. The determination 
of which root to evaluate must be left to the analyst. 



Figure 9-1 A function that has an infinite number of roots. 


The simplest method of finding the real zeros of f(x) is bisection. First, 
we identify an interval 

Xj < X < x 2 

such that the product 

(/(*,))(/(*2))«> (9-2) 

That is, on one boundary of the interval f(x) is positive and on the other 
boundary it is negative. Clearly, for the root to exist between the end 
values, this condition must hold true. This is seen in Figure 9-2. Once the 
interval is identified, another interval that satisfies the same property 
(equation 9-2) and is smaller is developed by way of the bisection method. 
That is, we evaluate the function at the midpoint of the interval and test it 
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Figure 9-2 Searching for zeros using the bisection method. 


to verify that the function is zero. If it is not (the usual case), the midpoint 
can be used as one end of a new interval, which will be smaller than the 
previous one. In particular, if the functions evaluated at both the midpoint 
and the original left end point of the initial interval are of the same sign, 
the zero lies to the right of the midpoint and the new interval is the 
previous right boundary and the midpoint. If, however, the function 
evaluated at the midpoint is of opposite sign to that evaluated at the left 
end point, the zero is to the left of the midpoint and the new interval is the 
midpoint and the previous left end point. All of this can be summarized as 
follows: 






JCj + x 2 

——— is a zero of f(x) 
the zero is on the new interval 
the zero is on the new interval 




■> x 2 


Xi + X, 


x v 


(9-3) 


Each iteration halves the length of the interval. Ideally, the initial 
interval should be identified so as to limit the required number of itera¬ 
tions. For example, an interval that is an integer will be reduced one-eighth 
by three iterations, one-sixteenth by four iterations, and one-thirty-second 
by five iterations. Interval reduction of one part in a thousand can be 
achieved with 10 iterations. The general formula is that the interval size 
can be reduced by a factor of 1 /2", where n is the number of iterations. 
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The bisection method raises the issue of how to end the iterative 
evaluation of the root. There are five common techniques. The first and 
most attractive approach is to specify the number of evaluations and then 
decide whether the bisection method is converging slowly enough to 
warrant finding another method. The second method is to test the absolute 
accuracy in x, that is, to determine whether the modulus of the differences 
between solutions of the midvalue is less than or equal to a small number. 
Another approach differs in relative accuracy; we test to see if a modulus 
of the difference in the successive values of x n divided by x w _, is less than 
or equal to £. Still other tests are intended to determine (1) whether f(x n ) is 
less than or equal to some acceptable value and (2) whether the difference 
between successive values of f(x) is less than or equal to some acceptable 
value. 

These methods have all been used on digital computers to stop the 
iterative solutions of /(x) = (). In evaluating roots on the pocket calculator, 
however, tabulated values of x n and f(x n ) can be quickly computed, so that 
the convergence process becomes apparent to the analyst. In fact, the 
analyst generally stops iterating when the law of diminishing returns takes 
over and he sees little improvement in his evaluation of the roots for each 
iteration of the method. 

About the only restriction on the bisection method is that in finding the 
zeros of functions with poles the bisection method locates the pole in a 
manner similar to that used to locate the zero (i.e., for functions where the 
approach to the pole from the right is positive and that from the left is 
negative).This problem will probably not be encountered, since the pocket 
calculator analyst will have at least sketched the function whose root he is 
trying to evaluate, thus knowing the characteristics of the function near the 
root. 


9-3 FALSE-POSITION METHOD 


The false-position method, sometimes called regula falsi , is based on the 
concept that, (a) when decreasing the interval in which the root is expected 
to be found and, (b) when the value of the function at one end of the 
interval is large compared with that at the other end, then the zero can be 
expected to be closer to the end where the function is small than to the end 
where the function is large. Interpolating between the values of the 
function at the end points of the interval, we can thus solve for the point at 
which this straight line passes through zero, using the equation 


y(x)=/(x 1 ) + 


/(•*2)-/(*i) 

x 2 ~x x 


(x-Xj) 


(9-4) 
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The line described by equation 9-4 has the zero at 

= X lf( X 2)~ X 2f( X l) 
f(x 2 )-f(x ]) 


(9-5) 


Having found the new estimate of the location of the roots, we evaluate 
the function at this point and use it to replace the previous end point 
whose function value has the same sign as f(x). At the same time we divide 
the function value of the other end point by 2. 

Figure 9-3 shows the selection of the new estimate of the root, and 
Figure 9-4 illustrates the process by which the interval is halved and the 
approximating line is shaped to permit rapid convergence of the estimation 
of the zero. 



Figure 9-3 Searching for roots using the modified false position method. 


9-4 NEWTON’S METHOD 

In Newton’s method the root is estimated and the tangent line of the 
function is computed at that point. Then the tangent line is projected until 
it intercepts the X axis to determine a second estimate of the root. Again, 
the derivative is evaluated and a tangent line formed to proceed to the 
third estimate of v:. This process is sketched in Figure 9-5. The procedure is 
straightforward, but does involve the evaluation of derivatives. The tangent 
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Figure 9-4 Interval halving to ensure rapid convergence of the false position method. 


line generated in this manner is given by 


y(x)=f(x„)+I'(x n )(x-x n ) 


(9-6) 


which, when> , (x) = 0, gives the recursion formula for iterative estimates of 
the root: 




/(-*„) 

/'(*„) 


(9-7) 



Figure 9-5 Searching for zeros using Newton’s method. 
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Sketches of the well-known cases where Newton’s method encounters 
difficulties appear in Figure 9-6, which emphasizes the analyst’s need to 
sketch the function that he is trying to evaluate. Unless the local structure 
of the function is well understood, Newton’s method should be avoided. 
The advantage to Newton’s method is that, unlike the bisection method or 
the modified false-position method, once it begins to converge on the root, 
it tends to do so very quickly. In fact, at each step it almost doubles the 
number of accurate decimal places in the estimate of the root. Thus with 
an accuracy to 3 places at one step we can expect 6 places at the next step 
and 12 places thereafter. When it works, Newton’s method is excellent. 


9-5 COMPLEX ZEROS 

To find complex zeros of analytic functions, we use the conventional 
complex variable notation—the independent variable is z — x + iy and the 
dependent variable is w=f(z)=f(x + iy), which is equal to u(x,y) + 
iv(x,y). The condition 


f(x + iy) = 0 (9-8) 

is then equivalent to the two conditions 

u(x,y) = 0 (9-9) 

v(x,y) = 0 (9-10) 

Since the equation u(x,y) = 0 defines curves in the complex plane and the 
equation v(x,y) = 0 defines another set of curves, it is only at the intersec¬ 
tion of these two sets of curves that w(z) = 0. In this sense, the problem of 
finding the complex roots of w=f(z) is equivalent to finding the intersec¬ 
tion of the two curves u- 0—v. Obviously, this is the problem of the 
simultaneous solution of two equations. Provided that the zero is not on 
the real axis and that z — x + iy is zero, the conjugate z c — x — w is also 
zero. 

The bisection method can be extended from the problem of finding real 
zeros to that of finding complex zeros. Again, we first find the interval in 
which we can expect to find a zero and then refine the estimate of the zero 
by reducing the interval in which the zero is expected. In the bisection 
method, we first searched the region of the real axis where we expected 
roots to occur, not by evaluating the function numerically, but by deter¬ 
mining the interval during which f(x) changes sign. Then we narrowed that 
interval until we found the location of the root (as close as we wished). In a 
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similar manner we break up the complex plane into regions where we can 
test, not the value of the complex function, but only its sign at suitably 
chosen points in the region and where we can determine the “quadrant- 
number” in which w=f(z) falls. In general, if w—f(z) is not zero (other¬ 
wise we would tabulate a zero at the grid point where it does equal zero), 
the quadrant numbers are defined as shown in Figure 9-7 and as they 
would be located in the complex plane in Figure 9-8. It is apparent that 
where the four quadrants meet we have a zero of w = f(z). Once the grid is 
prepared and the quandrant numbers are written on the grid, the curves 
u = 0 and o = 0 can usually be quickly sketched by keeping in mind that the 
curves w(x,y) = 0 divide quadrants 1 and 2 and quadrants 3 and 4, whereas 
o(jc,y) = 0 divide quadrants 1 and 4 and 2 and 3. Although this method is 
not very sophisticated, it is a convenient way of approximating roots in the 
complex plane on the pocket calculator. We therefore leave the more 
accurate evaluation of the complex roots to methods presented in other 
books. It is worth pointing out, however, that when a complex root is 
identified in this manner the region in the neighborhood of the root can be 
further subdivided to form a refined grid for more accurate root de¬ 
termination. 

An example of the use of this method is finding the complex zeros of the 
function 


w = az + b 

The zeros of this complex function are easy to derive; the example is 
chosen for its pedantic value in illustrating the process of sketching the 
u — 0=u curves to find the region in which the complex zero will occur 
(which can then be used to make the next more refined search for the 


p 


Q = 0 if it or v = 0 
Q = 1 when sign of 
<3 = 2 when sign of 
Q = 3 when sign of 
(3 = 4 when sign of 


{ 

{ 

( 

{ 


u is + 
v is + 

u is — 
V is + 
u is - 
v is - 
u is + 
v is — 


2 1 


3 4 


-> 


u 


Figure 9-7 Quadrant number definitions. 
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Figure 9-8 Quadrant numbers in the complex 
plane. 


complex zero of tv, etc.). First it is instructive to analytically find the zero 
of w. Substituting (x + iy) for z we find 

w = (ax + b) + i(ay ) = u(x,y) + v(x,y) 


Now 


u = 0 = ax + b 



a 


defines the u = 0 curve in the complex plane. Similarly 


or 


v = 0 = iay 


iy = 0 


defines the v — 0 curve in the complex plane. The intersection of these two 
curves is the zero of the function w = 0. This occurs at 


x= _b 

a 


iy = 0 


Now let us examine the use of the modified bisection method to sketch the 
u = 0=v curves for this function. 
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Table 9-1 shows the details of generating the quadrant numbers for 25 
test locations in the complex plane. These locations are the grid points in a 
five-by-five test space: 

x= — 2, —1,0, +1, +2 
y= — 1,0, +1, + 2, + 3 

The calculation of u and v and the analytical determination of the 
quadrant number using ±tan(w/u) are straightforward on the scientific 
pocket calculator, particularly those with rectangular-to-polar conversion. 
The procedure for computing the quadrant numbers is shown in Table 9-2. 

Table 9-1 Quadrant Numbers for the Function u = az + b = (ax + b) + (iby)= u(x, 
y) + b(x, y) When a ~ 1 and b= 1 


X 

y 

u 

V 

Quadrant Number by 
Calculation 

Quadrant 
±tan _1 (u/w) Number 

Quadrant 
Number 
by Inspection of 
Signs of u and t; 

(sign u, sign e) 

-2 

-l 

-1 

- 1 

0 

1 

3 


h>3 

-1 

-1 

0 

-1 

-90° 

0 

(0, -)- 

-+0 

0 

- i 

1 

- 1 

-45° 

4 

( + ,-)- 

-*4 

4- 1 

-l 

2 

-1 

-IT 

4 

( + .-)- 

->4 

+ 2 

-l 

3 

-1 

-18° 

4 

( + ,-)- 

-»4 

-2 

0 

-1 

0 

180° 

0 

(-.0)- 

-*0 

-1 

0 

0 

0 

Undefined 

0 

(0,0)- 

->0 

0 

0 

1 

0 

0° 

0 

(+,0)- 

^0 

+ 1 

0 

2 

0 

0° 

0 

(+,0)- 

-^0 

+ 2 

0 

3 

0 

0° 

0 

(+,0)- 

-*0 

-2 

+ 1 

-1 

+ 1 

135° 

2 

(->+)- 

->2 

-1 

+ 1 

0 

+ 1 

90° 

0 

(0,+)- 

->0 

0 

+1 

1 

+ 1 

45° 

1 

(+,+)- 

-*l 

+ 1 

+1 

2 

+ 1 

27° 

1 

(+,+)- 

->1 

+ 2 

+ 1 

3 

+ 1 

18° 

1 

(+,+)- 

-+1 



Table 9-2 Procedure for Computing Quadrant Numbers 


Evaluate u 
Evaluate v 


-1 

Reverse-Polish with 
Rectangular-to-Polar 


Algebraic Reverse-Polish Conversion 

(v) (v) (v) 

- Enter Enter 

(“) ( w ) (m) 

+ / 

/ / _1 R->P 





^output. 


(3) -^mental step done 
by operator. 
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Here H(9) is given by 


2=1 

if 0° < 0 < 90° 

2 = 2 

if 90° <9< 180° 

2 = 3 

. f ( 180° < 9<270° 

I -90° > 9> - 180° 

II 

O) 

. f [ 270° <0 060° 

1 0° > 0 > —90° 

o 

II 

O) 

if 0 = 0,90,180,270, or 0 


Though computing the quadrant number requires only a few key strokes 
on the scientific calculator, and is a systematic analytical procedure, 
quadrant numbers are more quickly determined by inspecting the signs of 
u and v. Quadrant number determination by calculation is shown here 
more for the sake of completeness than utility. 

Figure 9-9 shows the array of quadrant numbers located at their respec¬ 
tive test points. Note that the line separating the quadrants 1 and 4, and 2 
and 3, is the v = 0 line, while the line separating the quadrants 1 and 2, and 


2 0 111 


2 0 111 


2 0 111 


— 0 - 0 - 0 - 0 - 0 —=►* 


3 0 4 4 4 


Figure 9-9 Quadrant numbers and root location for complex function w = az + b where a = 1 
and b= 1. 
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Figure 9-10 The lines u = 0 and v =0 sketched from the matrix of quadrant numbers shown 
in Figure 9-9 for the complex function w = az + b (a — 1, b — 1). 


3 and 4, is the u = 0 line. These lines are sketched in Figure 9-10. While 
w = az 4- b is a simple function whose zero is easy to determine, the 
procedure is identical for complex functions of more complicated forms. 
Hamming has worked a transcendental equation that illustrates this 
method very well. Figure 9-11 shows the array of quadrant numbers for the 

4 

4 4 4 3 3 

1111144444433333 

I 

1 11 1 1 44444333333 

1 111 1 1 4443333333 

22111 14433332222 

2211111433222222 

I 

22011 11442222222 

22211111 11111111 
2222111111111111 
— 0-0-0-0-0-<j>-0-0-0-0-0-0-0-0-0- 

Figure 9-11 Quadrant number array associated with w— e z — z 2 . 
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function 


w~e z - z 2 

in the neighborhood of x = iy = 0. By inspection of the quadrant number 
field it is possible to sketch the « = 0= v curves, as seen in Figure 9-12. 



Figure 9-12 The curves u=0=o sketched from the matrix of quadrant numbers shown in 
Figure 9-11 for the complex function w = e z — z 2 . 


9-6 AN IMPROVED SEARCH METHOD 

This unsophisticated approach to identifying the complex roots of a 
function has one major fault. The evaluation of the quadrants for each grid 
point requires a number of calculations to determine points that do not lie 
near the zeros of the complex function and thus give relatively little 
information about the location of the zeros. An alternative is to track the 
m = 0 curve and to identify, by marking the spot where this curve crosses 
the u = 0 curve, the region where the root will exist. The approach is to 
search a « = 0 curve in a counterclockwise direction in the area we are 
examining. The u = 0 curve will be indicated by a change from quadrant 1 
to 2 (or 2 to 1) or from 3 to 4 (or 4 to 3). When we find the curve, we track 
it until we meet the t) = 0 curve. The u = 0 curve will be indicated by the 
appearance of a new quadrant other than the two that we were using to 
track the u = 0 curve. 

The procedure is straightforward and requires little practice to learn to 
conveniently track the w = 0 curve. In fact, the analyst learning to develop 
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this method should take known u and u functions and practice tracking 
them until facility is developed in tracking unknown w = 0 curves. 


9-7 PROBLEMS IN DETERMINING THE ZEROS OF POLYNOMIALS 

In Chapter 2 we discussed methods for computing the roots of algebraic 
equations of up to the fourth order. The methods presented in this chapter 
permit the evaluation of the roots of higher-order polynomials. Poly¬ 
nomials are treated as special functions in determinining the zeros of 
functions because much is known about their zeros. The fundamental 
theorem of algebra guarantees that an /?th-order polynomial will have 
exactly n roots. Once a zero is found, it can be “divided” out of the 
polynomial, thus reducing the order of the polynomial to a simpler form: 


(z-*l) 


r-M 


Also, all polynomials with real coefficients (those studied here) can be 
made up of linear and quadratic factors. If a polynomial is of odd order, at 
least one factor is linear and one root (at least) is real. Thus we must find 
the real root and divide it out of the polynomial. When the polynomial has 
been reduced to an even-order polynomial whose roots are either pairs of 
complex conjugate roots, pairs of real and equal roots, or pairs of real but 
unequal roots, we “scale” the polynomial so that the roots lie in a region 
that can be conveniently tested for the presence of additional real roots 
(they occur in pairs). When all real roots are identified, we merely need to 
find the complex roots for the remaining quadratic factors of the even- 
order polynomials. 

Even when numerical methods are available for determining the roots of 
polynomials, we still encounter significant difficulty in the computing 
aspect of this task. To solve linear differential equations with constant 
coefficients, commonly the indicial equation is developed and solved for 
the characteristic roots of the system. These roots are then used in the 
assumed solution function to determine the solution of the differential 
equation. If the characteristic roots of a second-order differential equation 
are both real and equal, the solution should take the form 

y = e~ k *(c l + c 2 x) 

If, however, any numerical error enters into the pocket calculator 
evaluation of the roots, two real and slightly unequal roots of the form 
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result 


rootj = - k + e 
root 2 — —k — e 

then the solution of the equation becomes 

y = C] e- {k+t)x + c 2 e- {k - c)x 

which is unlike the dynamics of a process with two real and equal roots. In 
this example, the purpose was to evaluate the characteristic roots of the 
differential equation, and even the smallest error affects the dynamics of 
the resultant solution to the differential equation. In another case we may 
be interested in the values of x and y that satisfy a system of simultaneous 
equations where small errors in the zeros result only in small errors in the 
evaluation of the simultaneous solution to the simultaneous equations. 

All this serves to illustrate the following important point: the problem of 
finding the zeros of a function means different things to different people. It 
is necessary to define precisely what is required of the analyst when trying 
to evaluate the roots of an equation (to find the zeros of a function). 

Hamming further brings out this important point by noting that 
theorems in mathematics do not necessarily apply to computing. In 
mathematics the notion of zeros of a function is a simple one, but not so in 
computing. Hence the answers to the question, “What is wanted of the 
zeros of a polynomial P(x)T ’ are usually as follows: 

1. Those values of x i that make |/ > (x,)| small should be as accurate as 
required. 

2. At the values x t the polynomial P{xf) should be as small as required. 

3. The polynomial may be constructed as accurately as required from 
the zeros. 

4. The zeros satisfy the auxiliary conditions for roots of polynomials (see 
Chapter 2). 

What is required for one problem may not be required for another. 
Generally, it is thought that (1) is the answer, but actually (3) or (4) is 
really what is usually sought in applied analysis (as shown here in the 
differential equation example). 


9-8 REFERENCES 
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CHAPTER 10 


STATISTICS AND 
PROBABILITY 


10-1 INTRODUCTION 


The determination of the statistics of finite data populations is the topic of 
this chapter. For very large data populations probability is emphasized. 
Since the formulas used in statistical analysis are uncomplicated and for 
the most part directly implemented on the pocket calculator, we focus on 
statistical analysis more than on analysis on the pocket calculator. The 
objective is to provide the pocket calculator owner with a classical basis of 
scientific statistical analysis. However, “tricks” of the pocket calculator 
trade will be mentioned whenever applicable. Also, when there are a 
number of alternative ways to compute a statistic, emphasis is placed on 
the formulas that are most easily evaluated on the pocket calculator. 

First we discuss the numerical evaluation of the statistics that 
characterize data populations. Emphasis is on measures of central 
tendency, measures of dispersion, data distributions, shapes of data distri¬ 
butions, and the elements of probability. Then we proceed to the concepts 
of sampling and testing. 

Throughout the chapter the focus is on the statistical analysis of groups 
of data and their functional interpretation in engineering and scientific 
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analysis. As in Chapter 6, we emphasize the following: 

1. Understanding statistical analysis on the pocket calculator. 

2. Providing useful formulas and tables of data for statistical analysis. 


10-2 FREQUENCY DISTRIBUTIONS 

A number of definitions and concepts are prerequisite to a discussion of 
the formulas for computing the statistics of data populations. The defini¬ 
tions presented here are working definitions and are not presented in 
abstract mathematical notation. We wish to impart a working knowledge 
of statistical analysis on the pocket calculator, rather than a theoretical 
knowledge of the field of statistics and probability. 

A basic data population is simply a collection of statistics called the raw 
data. Raw data are data that lack organization. Arrays and frequency 
distribution are ways of organizing the data, so that the statistics of a 
collection of data can be determined. 

Arrays are arrangements of raw data in ascending or descending order; 
that is, the data are tabulated starting with the largest number and 
proceeding to the smallest, and vice versa. We say that the range of an 
array is the difference between the largest and smallest numbers in the 
array. 

When large sets of data are stratified into categories and the number of 
elements in the data set belong to each category or class, we form a data 
distribution. This is done by generating a table of data by category or 
class, together with the class frequencies, that is, the number of elements of 
the set of all data belonging to each class. Such a tabular array is called a 
frequency distribution or frequency table. An example appears in Table 
10-1. Here the classes or categories are the intervals of height. Data 
arranged in a frequency distribution are often also called grouped data. 
The term “class mark” refers to the midpoint of a class interval. 

In general, frequency distributions are developed by first determining 
the range of the raw data, dividing the range into a convenient number of 
class samples of the same size, and then determining the number of 
observations that fall into each class interval (this, by definition, is called 
the class frequency). Once the frequency distribution is known, histograms 
can be developed to visualize the frequency distribution. Histograms are 
simply a plot of the frequency against the range of raw data. An example 
of a histogram for the frequency distribution in Table 10-1 is shown in 
Figure 10-1. 
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Table 10-1 Heights of 100 Male Students in a 
University 


Height 

(in.) 

Number of 
Students 

Cumulative 
Number 
of Students 

60-63 

4 

4 

63-66 

18 

22 

66-69 

41 

63 

69-72 

28 

91 

72-75 

9 

100 


The relative frequency of a class is its frequency divided by the total 
frequency of all classes. It is multiplied by 100 to obtain a percentage. 
Plots of relative frequency over the range of the data are called a percen¬ 
tage distribution or relative frequency distribution. 

The cumulative frequency distribution is simply defined as the total 
frequency of all data less the upper class or category boundary of a given 
class interval. The third column of Table 10-1 shows the cumulative 
frequency distribution of the height of the 100 students. A cumulative 
frequency distribution can be plotted over the range of data as shown in 
Figure 10-2. 

Relative cumulative frequency distributions are defined like the 
frequency distributions, and so is their percentage. 



Height (h) ("“in.) 

Figure 10-1 Frequency distribution (number of students in height interval A h). 
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Height (h) (~in.) 

Figure 10-2 Cumulative frequency distribution (cumulative number of students with heights 
less than or equal tc h). 


Were the data to increase without bound, the histogram’s frequency 
distribution and cumulative frequency distribution would be expected to 
be developed with increasingly finer quantitization until smooth curves are 
obtained. The analysis of probability using continuous functions is the 
field of probability analysis. Here we concentrate on the statistics 
associated with finite sized data sets. We examine probability as well, but 
our emphasis is on the statistical analysis of small sized data sets that can 
be reasonably analyzed on the pocket calculator. 


10-3 MEASURES OF CENTRAL TENDENCY 


The mean, median, mode, and other measures of central tendency are the 
statistics of data distributions. Numbers that inform us about the centroid 
of the distributions are called measures of central tendency. There are not 
a few such measures, all called averages. The most common of them have 
already been discussed, but we mention them again in the context of this 
chapter. They are the arithmetic mean, the geometric mean, and the 
harmonic mean. Two other measures of central tendency that are impor¬ 
tant in statistics are the median and the mode. The arithmetic mean is 
defined by the relationship 


X = 


N 




2 * 


N 


N 


( 10 - 1 ) 
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The arithmetic mean can be quickly evaluated, as mentioned in Chapter 3, 
either recursively or by simply summing all of the samples and dividing by 
the total number of samples summed. For calculators with the | 2 I 
function (shown as | M+ | on many calculators) the inclination is to 
perform the arithmetic mean with the key stroke sequence 


ALGEBRAIC KEY STROKE SEQUENCE 


/ 

M + 

(Xi) 

f 

M + 

(X 3 ) 

f 

M + 


(*.) 


f 

M + 

f 

A/—»jc 

□ 


(N) 




R 

X 


for a total of (2N + 6) function key strokes and (5N + 5) data entry key 
strokes (assuming 5 digits for every data entry), totaling (7N + 8) key 
strokes. With the straight arithmetic sum, however, the key stroke sequence 
is 


• ■ ■ pr|(x„)p](Af)p] 


and involves only ( 6N + 6 ) key strokes, thus saving 15 to 20% in key 
strokes. The benefits of the 2 or M + functions are only accrued when the 
sum 


2/(*> 2/(*,), 


7=1 


(/(*,)**,-) 


is computed. Then the intermediate calculation for evaluating the/(A)) can 
be conducted without disturbing their accumulation. 

If certain numbers occur more than once, in particular with frequencies 
/,,/ 2 , ...,/„, then the arithmetic mean is defined by 


_ K K 2 /^ 

x-'ZfjXj/'Zf ,-—- 

7=i Zf 


2fx 


N 


( 10 - 2 ) 


When the weighting factors are associated with certain numbers to 
emphasize or change their importance in the distribution, the weighted 
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arithmetic mean is defined as 


2 wx 



This is called the weighted arithmetic mean. 

The median of an array of numbers (i.e., numbers arranged in order of 
increasing or decreasing size) is defined to be the middle value of the array. 
If the array has an odd number of elements, there is a single middle value. 
If the array has an even number of elements, there are two middle values, 
in which case the median is defined as the average of the two middle 
values. 

From a histogram viewpoint, the median is that value of the data range 
which exactly divides the histogram into two equal parts. The mode of a 
distribution is that value which occurs most frequently. 

In general, distributions can have more than one mode, but only one 
mean and one median. 

For unimodal distributions that are only slightly asymmetrical (skewed) 
the mean, median, and mode are approximately related by the relation 

mean - mode = 3(mean — median) (10-4) 

The distinction between the mean, median, and mode is shown in Figure 
10-3. 



Figure 10-3 Mean, median, and mode of a typical symmetric distribution. 
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Another frequent measure of central tendency is the harmonic mean 
defined by the relation 


1 = N 


(10-5) 


From time to time the mean of the logarithm of a set of numbers a; is to 
be computed; then the geometric mean of the numbers is given by 

G = (X v X 2 ---X n )' /n (10-6) 


The recursion formulas for computing the geometric and harmonic 
mean are presented in Chapter 3. Computing the harmonic mean using the 
2 (or M+ ) function on the scientific pocket calculator is an excellent 
example of how this function saves key strokes. The key stroke sequence is 


ALGEBRAIC KEY STROKE SEQUENCE 



as opposed to 


(*,) 


/ 

\/x 

f 

| X-*. 

M 

(X 2 ) 

3 

1 / X 

1 + 1 

/ 

M-+X 

R 

f 

X—>M 












{*„) 

31 

\/x 

□ 

f 

M^X 

□ 

E 

j(A') / 

\/x 

□EJ- 




The number of key strokes with the 


M + 


operator is (6A+11) and 


(8A + 16) without. As much as 33% fewer key strokes are required when 
the 


M + 


function is available on the keyboard. 
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In reverse-polish the sequence of key strokes is 


REVERSE-POLISH KEY STROKE SEQUENCE 



(*.) 


1/x 




\/x 


H 


for a total of (7A + 7) key strokes. 

As mentioned in Chapter 3, the relation between the harmonic, arith¬ 
metic, and geometric means is 


H<G<X 


(10-7) 


when all of the numbers used for calculating these means are identical. 

Another mean, used extensively in this chapter, is the root mean square. 
It is a set of numbers defined by the relation 


RMS = 



( 10 - 8 ) 


The evaluation of the root mean square on many scientific calculators is 
simplified somewhat by the use of the M + x 2 function where the contents 
of the display register (X) are squared and added to whatever is in the 
memory register. 


10-4 MEASURES OF DISPERSION 

Dispersion is defined to be the distribution or spread of the data about the 
average value. It is also frequently called the variation of the data. The 
number of measures of dispersion or variation of data about the mean is 
almost as large as the numbers of great statisticians. Here, we are con¬ 
cerned with those that can be used in practical analysis, such as the range, 
mean deviation, and standard deviation of the data. 
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The range of the data was defined earlier. The mean deviation or average 
deviation of a set of numbers is defined by the relations 


mean deviation = MD = 


N 

2 \x r x\ 

j -1 


N 


2\X~X\ 

N 


= \X — X\ =avg(|A'-A'|) 
(10-9) 


The standard deviation is denoted by s and is defined by the relations 


7=» _ 

N - 1 


_ 2 1 / 2 

1/2 (2(x-x) 


N — 1 


2* 2 /-zz 

— ~y(X— X) 


N 


( 10 - 10 ) 


Occasionally the standard deviation is written in the “biased-estimate’' 
form: 


l(x-x) 


2 1/2 


N 


( 10 - 11 ) 


for N > 30. 

Finally the variance of a set of data is defined as the square of the 
standard deviation. A better form for computing the standard deviation 
from the pocket calculator viewpoint is 


5 = 


2* 2 /2* 


2-! 1 / 2 


N 


N 


- 2 l /2 

= [* 2 -^] 


S /^ 2 / 2 /* 


( 10 - 12 ) 


N 


N 


The standard deviation has a number of useful properties for computa¬ 
tional analysis. Thus for normal distributions (we discuss them later in this 
chapter) 68.27% of all the cases are included between the mean ± 5, that is, 
68.27% of all cases reside within one standard deviation of the mean. 

Similarly, 95.45% of all the cases lie within 2s of the mean and 99.73% of 
all the cases lie within 3 s of the mean. 
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If two distributions have total frequencies N V N 2 , their variations and 
$2 can be combined according to the relation 


N Y si + N 2 sj 
N l + N 2 ' 


[10-13) 


when both distributions have the same mean (not an infrequent case). The 
generalization of this formula to n distributions is straightforward when it 
is noted that the variance of n distributions with the same mean is simply 
the weighted arithmetic mean of the individual variances, where the 
weighting factor is the frequency of each distribution. 

As with absolute and relative error, we can quote absolute and relative 
dispersions. A measure of relative dispersion is given by 


relative dispersion = 


absolute dispersion 
average 


(10-14) 


If we use the measure of absolute dispersion to be the standard deviation 
and the average to be the mean, we can define the coefficient of variation to 
be 



(10-15) 


Clearly, the coefficient of variation is useless for zero-centered symmetric 
distributions. 


10-5 MEASURES OF DISTRIBUTION SHAPE 

Before going into measures of skewness and kurtosis, we must define the 
moments of a distribution. The rth moment of a distribution consisting of 
n values is given by 

N 

2*; v*' 

X '-^ - N ( 10 -‘ 6 > 

The first moment where r— 1 is the arithmetic mean. For a nonzero mean 
we can further define the rth moment about the mean as 

N 

= (X-X) (10-17) 
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Similarly, the rth moment about any origin A is defined by 

N 

2 

7=1 


^ (x _ A y 2d r 


= 


= (X-A) (10-18) 


N N N 

The moments for group data can be defined in a similar manner as 


N 

IfjXf 




2jx' 


m = 


N N 

'Lf(x-x) 

N 


(10-19) 


( 10 - 20 ) 


I,f(X-A) r 

m'r -—- 


( 10 - 21 ) 


Again note that these formulas are in forms immediately useful for 
evaluation on the pocket calculator without the need to rewrite them. 

The degree to which a distribution is asymmetric is specified by the 
skewness of the distribution. That is, if the distribution has a longer tail to 
the right of the distribution centroid, the distribution is said to be skewed 
to the right and to have a positive skewness. Conversely, if the distribution 
has a longer tail to the left of its centroid, the distribution is said to be 
skewed to the left and to have a negative skewness. The mean tends to lie 
on the same side of the mode as the longer tail for skewed distributions. A 
measure of symmetry due to Pearson is therefore given by the difference 
between the mean and the mode. This difference is then divided by the 
standard deviation (to make the measure dimensionless) to form a measure 
of the skewness. Thus Pearson’s measure of skewness is 


, mean-mode A-mode 

skewness =-:———:—:— =- (10-22) 

standard deviation s 

Kurtosis is a measure of the degree to which the distribution is peaked. 
A common measure of kurtosis is 


m 4 m 4 



(10-23) 
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That is, the fourth moment is divided by the fourth power of the standard 
deviation. For Gaussian distributions, the measure of kurtosis is 3. Kurto- 
sis is then sometimes defined by the relationship 

a 4 — 3 = kurtosis 

The moment of kurtosis is also referred to as the coefficient of excess. 


10-6 PROBABILITY 


Now that we have developed the basic definitions in statistics, we can 
examine the elements of probability, which we later relate to statistics 
through concepts in sampling. We will then be ready to discuss informa¬ 
tion theory, decision theory, the elements of nonparametric statistics, and 
concepts of correlation. For our work here we define probability as 
follows. If an event can happen in h ways out of n equally likely ways, the 
probability of occurrence of this event is given by 

p = Pr[E} = ^ (10-24) 

we also say that this is the probability of success of the event; similarly 
q=\—p is equal to the probability of failure of the event. Clearly, p + q 
must equal 1. 

If E x and E 2 are dependent events, that is, the probability that E 2 occurs 
given that E i has occurred is nonzero, we say that the probability E 2 will 
occur given E ] is the conditional probability of E 2 given E x and is written 


Pr{£ 2 |£,) 


Pr{£ 2 andF,} 
Pr(£.) 


(10-25) 


Now, the multiplication law or the law of compound probabilities (i.e., the 
probability that both E x and E 2 occur simultaneously) is given by 


Pr{F 1 F 2 } = Pr{F 1 }Pr{£ 2 |F 1 } (10-26) 

and the addition law of probability (i.e., the probability that either E x or E 2 
occurs) is given by 


Pr{ £, + E 2 ) = Pr{ £,} + Pr{ E 2 ) - Pr{ E X E 2 ) 


(10-27) 
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If the events are mutually exclusive, then 

Pr{ £, + E 2 ) = Pr{ E l } + Pr{ E 2 ) (10-28) 

since Pr(£’,£’ 2 ) = 0. If the events are statistically independent, then 

Pr{£ 1 £’ 2 } = Pr{£,}Pr{£ 2 } (10-29) 

since Pr(£ 2 |£,) = Pr(£ 2 ). 

Clearly, for events that occur discretely, such as rolling dice and flipping 
coins, we can form a discrete probability distribution by tabulating the 
event and the probability that it will occur. An example appears in Table 
10 - 2 . 


Table 10-2 Sum of Points on a Single 
Throw of Two Dice 


Sum of 
Points 
= X 

Pr(X) 

Cum Pr(X) 

2 

1/36 

1/36 

3 

2/36 

3/36 

4 

3/36 

6/36 

5 

4/36 

10/36 

6 

5/36 

15/36 

7 

6/36 

21/36 

8 

5/36 

26/36 

9 

4/36 

30/36 

10 

3/36 

33/36 

11 

2/36 

35/36 

12 

1/36 

1 


Also, discrete probability distributions can be plotted like histograms. 
Cumulative probability distributions can be developed in the same way as 
our cumulative relative frequency distributions. Then, as the number of 
probable events approaches infinity, the probability distribution functions 
become increasingly dense until we can form continuous probability distri¬ 
bution as the limit to which the discrete probability distribution 
approaches as the quantitization in the process goes to zero. 
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A concept commonly used in probability is that of expectation. Expecta¬ 
tion is defined as follows. If X denotes a discrete random variable that can 
take on values X x ,X 2 ,...,X k with associated probabilities p { ,p 2 ,...,p k , 
where p x +/> 2 + ' ’ * + Pk = U the expectation of X is defined as 

E(X)-'2p j Xj-'2pX (10-30) 

y-i 

The extension of equation 10-30 to continuous distribution is obvious. In 
this case, we would define E(x)— ja(mu), the mean of the population; m 
(the mean of the sample) is an estimate of the true E(x). 

It should be clear from this discussion a very large random sample of 
size N from a population would result in a sample mean that is very near 
the population mean, whereas a sample mean based on a small sample 
would not be likely to be very near the population mean. 


10-7 PROBABILITY DISTRIBUTIONS 

Of the many distributions used in probability and statistical analysis three 
are frequently encountered in engineering and scientific work: the 
binomial, the Gaussian, and the Poisson. 

The binomial distribution is defined by 


p(X) = »C xP x q 


X„ N - X _ 


N\ 


X\(N-X)\ 


p x q N ~ x 


(10-31) 


where p is the probability of success, that is, the probability that an event 
will happen in any single trial; q is the probability that it will fail to 
happen in any single trial, usually called the probability of failure and 
equal to 1 —p\ and p(X) is the probability that the event will happen 
exactly X times in N trials. Here X is defined only on the integers, that is, 
A = 0,1,2,...,A. By definition this is a discrete probability distribution. Its 
name reflects the fact that as X takes on integer values from 1 through N, 
the corresponding probabilities are given by the terms in the binomial 
expansion 


(p + q ) N = q "+ N c,p q "-'+ N cy<i N - 2 +---+p» (10-32) 
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The statistics associated with the binomial distribution are developed in 
many books and will not be repeated here. Four are usually used in 
practical analysis, the mean, defined by 


li = Np 


(10-33) 


the variance defined by 


a 2 = Npq 


(10-34) 


the coefficient of skewness defined by 


q-p 

* 3 ~(W /2 


(10-35) 


and the measure of kurtosis (or coefficient of excess) given by 


^ , 1 - 6 pq 

a 4 = 3 T —- 

Npq 


(10-36) 


As an example of the application of the binomial distribution and its 
statistics, let us consider 100 flips of an unbiased coin. The probability is 
one-half that the coin will be heads and one-half that it will be tails. The 
mean thus is 


p = Np 


p= 100x^ = 50 


The standard deviation is given by the square root of the variance: 


o = V Npq 


a= VloOx =V25 =5 
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Skewness is zero, since p = q, and the measure of kurtosis equals 


a 4 = 3 + 


1 -6pq 
Npq 


«4 = 3 + 


( 1 - 1 ) 

25 


a 4 — 3 50 

a 4 ^3 

We see that the expected number of heads in 100 flips of a coin is 50. 
The standard deviation for 100 trials is 5. We would not expect the 
distribution of heads and tails to be skewed (if the coin is unbiased), but 
would expect it to be approximately Gaussian (the kurtosis of a Gaussian 
distribution equals 3). 

The Gaussian distribution is defined by the equation 

1 (x-fiY 

(10-37) 

aV 

where 


l± = the mean 

a = the standard deviation 

The Gaussian distribution has the following characteristics: 

1. The area bounded by the distribution and the X axis is identically 
equal to 1. 

2. The area bounded by the distribution and the X axis in the interval 
between x = a and x = b where a < b is identically equal to the probability 
that X lies between a and b. The Gaussian distribution is shown in Figure 
10-4. 

The parameters of the Gaussian distribution are the mean and variance, 
defined by 


mean = p 
variance — a 2 
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-«—68.27%-^- 

-95.45%- 

-99.73%- 


Percent of area enclosed 
by y and the abscissa 

1, 2, and 3 

Figure 10-4 Gaussian distribution. 


The skewness of the Gaussian distribution by inspection is zero. The 
measure of kurtosis for the Gaussian distribution is given by 

a 4 = 3 

As mentioned before, the binomial and normal distributions are related. 
If for a binomial random variable X,N, and P are not zero nor near zero 
and N is large, the binomial distribution can be approximated with the 

Gaussian distribution, since (X — Np)/vNpq is approximately normal 
with 0 mean and unit (1) variance. It turns out that the Gaussian distribu¬ 
tion is the limiting form of the binomial distribution as N approaches 
infinity. In practical work the Gaussian distribution is a reasonable 
approximation of the binomial distribution if both Np and Nq are greater 
than 5. 

Another probability distribution encountered in practical probability 
work is the Poisson distribution, defined by 

(10 ' 38) 

Here X is a constant of the distribution. 

The statistics of the Poisson distribution are all given in terms of the 
single parameter X. The mean is given by 

I u = X 


(10-39) 
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The variance is given by 

ct 2 = A (10-40) 

which is identically equal to the mean (an interesting curiosity of the 
Poisson distribution, which we discuss later). The coefficient of skewness 
of the Poisson distribution is 


*3=-^ (10-41) 

Vx 

and the measure of kurtosis is 

0 4 = 3 + i (10-42) 

A number of things become apparent by examining the Poisson distribu¬ 
tion. First, it is discrete and its basic properties are not a function of the 
number of trials being considered. Second, the larger the mean (A), the less 
the skewness and the more the distribution tends toward the Gaussian 
distribution; accordingly the coefficient of excess approaches 3. From our 
observations of skewness and kurtosis we might expect that the Gaussian 
distribution is the limit to which the Poisson distribution approaches as A 
approaches infinity; this, in fact, turns out to be the case. 

As might be expected, the binomial and Poisson distributions are re¬ 
lated, both being discrete and both approaching the Gaussian distribution 
as a limit for large numbers samples (in the case of the binomial distribu¬ 
tion) or a large value of the mean (in the case of the Poisson distribution). 
Note that in the binomial distribution if N is large and the probability p of 
occurrence of an event is close to zero, q = { 1 —p) is close to 1 and we say 
that the event is a rare event. In these situations the binomial distribution 
can be closely approximated by the Poisson distribution by letting A = Np. 
Comparison of the mean, variance, skewness, and coefficient of excess 
when A = Np, p = 0, and qsz\ shows that the binomial distribution proper¬ 
ties (e.g., mean, variance) are approximately equal to those of the Poisson 
distribution. Extension of the Poisson distribution to its association with 
the Gaussian distribution follows through its association with the binomial 
distribution. After a little algebra, it can be shown that the Poisson 
distribution can be approximated by the Gaussian distribution, since 

(*-*) 

vx 

is approximately normally distributed with zero mean and unit (1) variance. 

It is common in statistical analysis to use these distribution functions to 
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model the distribution of populations being studied. The approach is to 
determine the mean and standard deviation of the sample of the popula¬ 
tion to estimate the mean and standard deviation of the population. The 
modeling process is then tested for goodness of fit, using a number of 
approaches. One that we discuss later in this chapter is the chi-square test. 
Clearly, modeling a sample of a population with the binomial or Poisson 
distribution solely amounts to determining the mean value (X) of the 
sample distribution. For modeling the binomial distribution we compute 



Then 


q=\-p 

A crude test is to determine if 

Nqpzzs 2 

where s is the standard deviation of the sampled population. If the 
calculation shows that 


/>« 0 

then the Poisson distribution may better fit the data. 


10-8 SAMPLING 


The study of sampling deals with the determination of statistics (estimates 
of distribution parameters) associated with a sample drawn from a popula¬ 
tion and the population parameters themselves. We are concerned with 
sampling insofar as it is related to estimation, testing, and statistical 
inference. First we wish to define a sample, in particular a random sample. 
Generally it is not sufficient to sample a population in a systematic way. 
Since the objective is to develop the statistics of the sample and infer 
something about the parameters of the distribution, the sample that is 
representative of the population must be chosen. There are a number of 
methods for sampling a population, including stratified and random sampl¬ 
ing. One universally accepted way of sampling a population so that the 
characteristics of the population are represented in the sample is random 
sampling. Random sampling is a process in which each member of the 
population has an equal chance of being included in the sample. Typically 
a number is assigned to each member of the population, and the numbers 
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are then scrambled, so that each sample of the population has the same 
chance of being selected as any other sample. 

There are two concepts of sampling: with and without replacement. The 
concepts are straightforward. Sampling with replacement allows each 
member of the population to be chosen more than once, while sampling 
without replacement does not. The analysis is useful because a finite 
sample with replacement can theoretically be considered infinite. Sampling 
without replacement will result in a statistic that takes into account the size 
of the sample compared with the total size of the population. 

For any given sample of size N a value of a given statistic can be 
computed for that group and, if another sample of the same size is 
selected, in general a different value of the statistic is computed. This 
process can continue (with or without replacement) until all statistic values 
are in a population different. Clearly, if a population is of finite size and is 
sampled with replacement, an infinite number of values can be determined 
for each statistic. By organizing these values of the statistic, we obtain a 
distribution of the statistic itself, which is called its sample distribution. 
Clearly, then, there are sampling distributions of the mean, sampling 
distributions of the standard deviation, sampling distributions of the 
variance, sampling distributions of the measure of kurtosis, and sampling 
distributions of the coefficient of skewness. Of all these possible statistics 
we focus on those that help us in testing, estimating, and statistical 
inference only. 

Suppose that we have a population of finite size, say N p . Also suppose 
that a sample of size N is drawn without replacement. Then if we denote 
the mean of the sample distribution of the mean by 

Px 

and the standard deviation of the sample distribution of the mean as 

a x 

and, further, if we define the population mean and standard deviation by ju 
and a, respectively, we can write 


Vx = l l 


ov = 


Vn 


N p -N 
n p -i 


1/2 


(10-43) 


If the population is infinite or if it is finite but sampled with replacement, 
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the population mean and standard deviations are related to the sample 
mean and standard deviations according to 






<7 

Vn 


(10-44) 


For N greater than 30, the sample distribution of the mean is approxi¬ 
mately Gaussian with mean 


and standard deviation 


/^Gaussian t^X 


^Gaussian ^X 


It is noted that the distribution of the mean of a population sample is 
independent of the distribution population. 

Suppose now that we draw a sample from each of two populations. The 
number of samples drawn from the first population is N x and that from the 
second population is N 2 . Now let us compute a statistic s v We find that 
each statistic has a sampling distribution, whose mean and standard 
deviation we denote by (i S] and a . A similar situation holds true for the 
second population; that is, it has a mean and standard deviation given by 
li S2 and ct^. If we now consider all possible combinations of these samples 
from the two populations, we can obtain a distribution of differences, that 
is, S { — S 2 , which is called the sampling distribution of the differences of 
the statistics. The mean and standard deviation of this distribution are 
defined by equation 10-45: 

/As,-s 2 = As, ~ /As 2 



(10-45) 


If S x and S 2 are the sample means from two populations, the sampling 
distribution of the differences of the means is given for infinite populations 
with mean and standard deviation and j a 2 ,a 2 : 


fbc', - x 2 ~ l i x 1 M x 2 ~ Mi ^2 


1/2 




°X ] -X 2 " : ( a ^i + a ^2) 


1/2 


(10-46) 
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These results hold for finite populations if sampling is with replacement. 
The standard deviation of a population can also be computed from a 
sample of the population. It, too, has a distribution with mean and 
standard deviations 


/ M4— A*I \ 
\ 4^2 ) 


(10-47) 


For populations that are normally distributed this reduces to (ju 2 = a 2 and 
^ 4 = 3a 4 ): 


fx s = a 


o 

a= - 

V2 N 


(10-48) 


10-9 STATISTICAL ESTIMATION 

We have just seen how statistical information can be computed from the 
data of the sampled population. One of the key problems in statistical 
inference is that of estimation of the population parameters (mean, 
variance, kurtosis, etc.) from the corresponding sample statistics. Before 
proceeding to the concept of confidence interval estimates for population 
parameters, the key issue in this section, we must clear up the issue of 
biased versus unbiased estimation. In computing a mean value one merely 
sums the numbers of the sampled values and divides by the total number. 
In computing the standard deviation, however, it is important to recognize 
that it takes at least two points to compute a variance; hence the mean of 
the variance of a distribution must be divided by N—\, not N. In this 
sense, then, we have an unbiased estimator. As N becomes large, the effect 
of biased estimation is clearly not significant. For small sample sizes, 
however, it does matter. 

We now estimate the confidence interval of population parameters. The 
idea here is to sample the population, then compute a mean and standard 
deviation for it, and try to infer what this tells us about the mean and 
standard deviation of the population. If we define }i s and o s as the mean 
and standard deviation of the sampling distribution of a statistic S, then 
we can expect the sampling distribution of S to be approximately Gaussian 
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(assume that N > 30) and the actual sampled statistic S lying somewhere in 
the interval /x, - a, to /x, + a„ or ju,-2a, to /x,+ 2a„ or n s -3a s to fi s + 3a,— 
about 68.3%, 95.5%, and 99.7% of the time, respectively. 

In other words, we can be confident of finding /x, in the interval S — a, to 
S+a s about 68.3% of the time; or in the interval S —2a, to 5 + 2a, about 
95.5% of the time; or in the interval between S'-3a, to S + 3a, about 99.7% 
of the time. We can say that we are 68.3% confident that the mean lies 
somewhere in the interval S±o s , that we have 95.5% confidence that the 
mean lies somewhere in the interval S±2o s ; and that we have 99.7% 
confidence that the mean jx, lies somewhere in the interval S±3o s . The 
relationship between confidence levels and a levels is tabulated in Table 
10-3. 


Table 10-3 Confidence Levels 
Associated with a Levels 


Confidence Level 

(%) 

ct Level 

^c 

50 

0.6745 

68.27 

1.0000 

80 

1.28 

90 

1.645 

95 

1.96 

95.45 

2.00 

96 

2.05 

98 

2.33 

99 

2.58 

99.73 

3.00 


The formula for computing the confidence interval associated with the 
mean is given by 


If the sampling is from an infinite population or from a finite population 
but with replacement, the confidence limits in the estimate of the mean are 
specified by 


X±z 


C 


O 

Vn 


(10-49) 
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if sampling is without replacement from a population of finite size N : 

I n - TV \ 1/2 

"°' !01 

The equations for confidence intervals for differences and sums of two 
statistics S { and S 2 are given by 

(S l — S 2 ) ± z c o Si _ S2 — (S { — S 2 ) ± + ajj (10-51) 

(S l + S 2 )±ZcO Si + S2 = (S l + S 2 )±Zc (a* + a] J V2 (10-52) 

provided that the distribution of S , and S 2 is approximately Gaussian. 

By way of example, note that the confidence limits for the difference of 
two populations means in a case where the populations are infinite or are 
finite but with replacement are given by 


(S, - X 2 ) ± = (X, — X 2 ) ± z c ( ^ J (10-53) 

The confidence interval for standard deviations of a normally distributed 
population is given by 

S± z o s = S± (10-54) 

V2N 

Occasionally we need to reference probable error; we define it here but 
retain the concept for reference. The 50% confidence limits of the popula¬ 
tion parameters corresponding to a statistic S are given by S ± 0.6745a 5 . 
This quantity is known as the probable error of the estimate and may be 
worth memorizing. 


10-10 SAMPLING IN THE SMALL 

Earlier in the book we made use of the fact that there are simplifications in 
the formulas for computing a statistic when N > 30. Here we are concerned 
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with cases when N is substantially less than 30, and the emphasis is on the 
determination of the statistics associated with small samples and on the 
distribution of those statistics. Specifically, we consider here the students 
/-distribution and the chi-square distribution. 

The students /-distribution is defined by 


t=-—L-V n^T = — ~ (10-55) 

5 j/V/V 

If we consider samples of size N selected from a Gaussian distribution with 
mean ju and if we compute / given the sample mean and sample standard 
deviation s, the sampling distribution for / can be obtained. This distribu¬ 
tion is given by 


Y = 


(l + / 2 /(/v-l)) 


N/2 


1 + 




2 , (*'+ l)/2 


(10-56) 


Here T 0 is a constant depending on N and is such that the area under the 
/-distribution is 1. The constant ^ == (A^ — 1) is called the number of degrees 
of freedom. This distribution is called “students” /-distribution. Note that 
for large values of v or 7V(7V>30) the curves closely approximate the 
Gaussian distribution: 


Y = 


—1 _ e ~‘ 2 /2 

V2tt 


(10-57) 


We can define the 95 and 99% confidence intervals by either computing 
the confidence intervals on the pocket calculator or using a table of 
/-distributions. Specifically, if — t 0975n + / 0 975 are the values of / for which 
2.5% of the area lies in each tail of the /-distribution, then a 95% 
confidence level for / is 

yr _ 

- ' 0.975 < < < 0.975 ( 10 - 58 ) 

Clearly, then, /a is expected to lie in the interval 




.975 


VjV-1 


< ju < X + / 0 


975 ( VW^l ) 


(10-59) 


with 95% confidence. 
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In general, we can represent the confidence limits for population means 
by 


X±L 


Va^T 


(10-60) 


10-11 CHI-SQUARE 

We now proceed to the chi-square random variable, which is defined by 

2 Ns* (■ X,-X) 2 +{X 2 -Xf+---+{X N -X ) 2 

X = — =- 2 - (10-61) 

o o 

The chi-square distribution is defined by 

Y = Y 0 (x 2 )' /u "~ 2) e ~ ( * : / 2 ) = YoX r-2 e -x‘/2 ( 10 - 62 ) 

Here v - N — 1 is the number of degrees of freedom and T 0 is a constant 
depending on v such that the total area under the curve is 1. The 
chi-square distribution corresponding to various values of v are shown in 
Figure 10-5. As was done with the normal and /-distributions, we can 



Figure 10-5 Chi-square distribution for various degrees of freedom. 
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define 95 and 99% or other confidence limits and intervals for chi-square 
by using a table of the chi-square distribution. In this manner we can 
estimate, within specified confidence limits, the population standard devia¬ 
tion a in terms of the sample standard deviation j. If X 0.025 an d X 0.095 ^ ave 
the values of x 2 for which 2.5% of the area lies in each tail of the 
distribution, the 95% confidence interval is 

„ Ns 2 

Xo. 025 ^ 2 ” ^ X 0.975 (10-63) 

o 

We can see that o is estimated to be in the interval 


with 95% confidence. 




Vx 0.975 


< a < 



(10-64) 


10-12 CHI-SQUARE TESTING 

The chi-square test is probably the most accepted test for determining 
whether there is significance between the observed and the expected in 
sampling a population. There are better tests. Once familiar with the 
chi-square test—the only one discussed in this book—the student and 
infrequent statistical analyst may wish to go on and examine the other tests 
as well. 

It is only reasonable to expect that the results of a sampling of the 
population will not agree exactly with expected values. In fact, it is 
possible to partition a sample into a set of possible events E v E 2 , E 3 , 
E A ,...,E n , and then tabulate the observed frequency with which these 
events occur and the expected frequency from expectation analysis. For 
example, the distribution function for a particular experiment might be 
.expected to be a Poisson distribution. After some hand analysis the 
expected value of the parameter X in the distribution function is identified 
and the distribution itself is drawn. Then the interval of the random 
variable can be partitioned into subintervals and the probability that the 
random variable will occur in these intervals is computed on the basis of 
the expected distribution. These expected frequencies can then be tabu¬ 
lated for each event, as shown in Table 10-4. After completing the table by 
filling in the probabilities or frequencies of the observed events we are 
ready to perform the chi-square test. Specifically, we are ready to deter¬ 
mine whether the observed frequencies differ significantly from the ex¬ 
pected frequencies. 
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Table 10-4 One-Way Classification Table 


Event 

Observed 

Ex 

£2 

£3 

E k 

Frequency 

Expected 

°\ 

°2 


o k 

Frequency 


c 2 

e 3 

e k 


A definition of the difference between the observed and expected 
frequencies is given by the statistic x 2 : 


(o,-e,) (o 2 -e 2 ) 

X =-+-+ ... + 

e l e 2 


(10 . 65) 


j= i J 


Here the total frequency is N : 


N = '2o J = '2 ej 


( 10 - 66 ) 


An alternate form of equation 10-65 commonly found in the literature is 

of 

X 2 = Z~-N (10-67) 

e J 

Note that when the observed and expected frequencies agree, x 2 “ 0* 
Obviously, the greater the difference between the two, the greater x 2 - The 
sample distribution of x 2 can be closely approximated by the x 2 distribu- 
tion 


Y = V ^ - * 72 (10-68) 

when the expected frequencies get larger than 7. In fact, as the expected 
frequencies increase, the distribution more and more closely approaches 
the x 2 distribution. Here the number of degrees of freedom is given by 

v = k -1 (10-69) 

when the expected frequencies can be computed without having to deter¬ 
mine the x 2 distribution parameters from the sample statistics themselves, 
and the number of degrees of freedom 17 is given by 

v = k— 1 — m (10-70) 

if m population parameters are determined from the sample statistics. 
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The table of observed and expected frequencies can be extended to a 
table of k columns and h rows in which the observed frequencies occupy h 
rows and k columns. These are commonly called contingency tables. 
Corresponding to each observed frequency there is an expected or theoreti¬ 
cal frequency. The x 2 is then computed for the entire matrix, using the 
formula 


X 


2 




(10-71) 


Where the sum is taken over all the elements of the matrix made up by the 
contingency table. The sum of all observed frequencies is noted by N and 
is equal to the sum of all expected frequencies. As with the one-way 
classification table (Table 10-5) the distribution of the statistic for the h X k 
table is given very closely by the x 2 distribution provided that the 
frequencies are not too small. 

The number of degrees of freedom of this x 2 distribution for k > 1 and 
h > 1 is given by 


v = (h-\)(k-\) (10-72) 

if the expected frequencies are not computed from the population sample. 
If they are, the degrees of freedom are given by 

v — (h — 1)(& — \) — m (10-73) 

when m population parameters are determined from the sample statistics. 

Usually expected frequencies are computed on the basis of a hypothesis 
we are trying to test. If the computed values of x 2 are greater than some 
critical significance levels, we can conclude that the observed frequencies 
differ significantly from the expected frequencies. In this case we would 
reject the hypothesis being tested at the test level of significance. For 
example, if the value of x 2 were to exceed the Xo. 95 > we would say the 
hypothesis is rejected at a 95% level, that is, there is a 5% chance that the 
rejection of the test could be wrong. Conversely, there then is a 95% 
chance that we are correct in rejecting the hypothesis. 

In a similar manner, we would expect that, when x 2 is approximately 
zero, the observed frequencies are in too close agreement with the expected 
frequencies. Obviously, the approach here is to also reject those cases that 
we consider to be in too close agreement, that is, when x 2 is less than the 
expected value of x 2 95% of the time, such as Xoos- 
The procedure we have just described assumes that we are going to 
compare results from frequencies determined with continuous distributions 
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but applied to the discrete data associated with the finite samples. Yates 
has developed a correction factor to account for this difference. The 
corrected value of x 2 is given by 


X corrected 


(|oi-e,| —0.5) (\° 2 ~e. 


+ 




(10-74) 

This correction is made only when 17 = 1. For large samples, the corrected 
X 2 approaches the uncorrected x 2 - The reason for the correction is that the 
uncorrected x 2 can lead to significant errors near the critical values, that is, 
the low probability values of the x 2 distribution. For small samples where 
the expected frequency is on the order of 7, it is better to use the corrected 
value of x 2 - In fact, some authors suggest the use of testing with both the 
corrected and uncorrected values of x 2 ; if both lead to the same conclu¬ 
sion, the conclusion then is said to be unambiguously redefined. 

There are a number of simple formulas for pocket calculator determina¬ 
tion of x 2 that use only the observed frequencies. For the x 2 testing table 
shown in Table 10-5, x 2 can be computed as follows: 


x 2 = 


N(a l b 2 -a 2 b l Y 


Nh 2 


( a i + bi)(a 2 + b 2 ){cii + a 2 )(b , + b 2 ) N\N 2 N A N B 


(10-75) 


where A = {a x b 2 - a 2 b x ). With Yates’ correction 

2 _ jV(|A|-A72 ) 2 

X corrected N,N 2 N A N g 


(10-76) 


Table 10-5 A 2x2 Table of Observed 
Frequencies 


Event 

Group 

Event 

I II 

Total 

A 

a \ 

*2 

N a 

B 

*1 

b 2 

n b 

Total 


» 2 

N 
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Similarly, for a 2 X 3 table, x 2 is given by 


2 N 


a\ a\ a\ 
—— H + — 

N x N 2 N 3 


+ 


N 


b\ b\ b\ 

—— H-— H-— 

N i A, A, 


-A (10-77) 


Example 10-1 If 20% of the transistors produced by a process are defec¬ 
tive, using the binomial distribution determine the probability that out of 
four transistors chosen at random, one, none, and at most two will be 
defective. Since the probability of a defective transistor is 

P = 2 

and of a nondefective transistor it is 

q~ \ — p = -8 

then 


probability (1 defective transistor out of 4) 

= 4 C, (.2 ) 1 (. 8) 3 = 77^ZT)7 (- 2 )‘(- 8) 3 = -4096 

probability (0 defective transistors) = 4 C 0 (.2)°(.8 ) 4 = .4096 
probability (2 defective transistors) = 4 C 2 (.2) 2 (.8 ) 2 = .1536 

Thus 

probability (at most 2 defective transistors) 

= probability (0 defective transistors) 

+ probability (1 defective transistor) 
+ probability (2 defective transistors) 
- .4096 + .4096 +1536 = .9728 


Example 10-2 The mean weight of 500 engineers is 170 lb, and the 
standard deviation is 15 lb. Assuming the weights to be normally distri¬ 
buted, find how many engineers weigh between 139 and 175 lb and how 
many weigh more than 185 lb. Weights recorded as being between 139 and 
174 lb can actually have any value from 138.5 to 174.5 lb, assuming that 
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they are recorded to the nearest pound. 

138.5 lb in standard units = — -2.10 

174.5 lb in standard units = — -0.30 

Then the number of engineers between 139 and 174 lb is (Figure 10-6) 

(area between z = - 2.10 and z = 0.30) = (area between z = —2.10 and z 

= 0) + (area between a = 0 and z = 0.30) = 0.4821 + 0.1179 = 0.6000 

The number of engineers weighing between 139 and 174 lb is 500(0.6000) 
= 300. 



Figure 10-6 Percent of engineers weighing between 139 and 174 lb. 


Example 10-3 Find the probability of obtaining between 3 and 6 heads 
inclusive in 10 tosses of a coin by using the binomial distribution and the 
normal approximation of the binomial distribution. 


Probability 



Figure 10-7 Discrete binomial distribution. 
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The binomial distribution gives (Figure 10-7) 

probability (3 heads) = 10 C 3 (^) 3 (i) 7 = ^ 
probability (4 heads) = 10 C 4 (i) 4 (i) 6 = 
probability (5 heads) = 10 C 5 (j) 5 (i) 5 = & 
probability (6 heads) = i0 C 6 (i) 6 (i) 4 = 

Thus 

probability (of getting between 3 and 6 heads inclusive) 

= 15/128+ 105/512 + 63/256+ 105/512 = 99/128 = 0.7734 

The probability distribution for the number of heads in 10 tosses of the 
coin is shown graphically in Figure 10-8. The required probability is the 
sum of the areas of the cross-hatched rectangles and can be approximated 
by the area under the corresponding normal curve shown dashed. 


Probability 



Number of heads 

Figure 10-8 Data treated as continuous normal distribution. 


Using the normal distribution, 3 to 6 heads can be considered as 2.5 to 
6.5 heads. The mean and variance for the binomial distribution are given 
by 

H= Np = 10(i) = 5 

and 


o=\/Npq =V(10)(i)(i) =1.58 
2 5 — 5 

2.5 in standard units = -- - = - 1.58 

1.58 
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and 

6.5 in standard units = -- =0.95 

1.58 

The probability of getting between 3 and 6 heads inclusive in the area 
between z = — 1.58 and z = 0.95 under the normal distributipn curve is 

(area between z = — 1.58 and z = 0) + (area between z = Q and z = Q.95) 

= 0.4429 + 0.3289 = 0.7718 

which compares very well with the true value 0.7734 obtained using the 
binomial distribution. 


Example 10-4 Measurements of the diameters of a random sample of 200 
bearings made by a certain machine showed a mean diameter of 0.824 in. 
and a standard deviation of 0.042 in. Find (a) 95% and ( b) 99% confidence 
limits for the mean diameter of all the bearings that will be made by this 
machine. 

The 95% confidence limits are given by 


Vn 

which is approximately 


X ± 


1.963 

Vn 


In this example, X = 0.824 in. and 3 = 0.042 in. Thus 

0.824 ± l %( °j 4 j | 

W200 / 


0.824 ± 0.0058 in. = (0.8182,0.8240) 

with a 95% confidence interval. 

The 99% confidence limits are given by 


X± 


2.583 

V77 


which is numerically equal to 


/ 0.042 \ 

\ V200 / 


0.824 ±2.58 



276 


Statistics and Probability 


Thus 

0.824 ± 0.0077 in. = (0.8163,0.8317) 

with a 99% confidence interval. Note that we have assumed the reported 
standard deviation to be the unbiased standard deviation s. If the standard 

deviation had been s, we would have used s = s}/n /(N — 1) 
= V200/1995 , which can be taken as 5 for all practical purposes. 


Example 10-5 A small sample of 10 measurements of the length of a bolt 
give a mean of A = 4.38 in. and a standard deviation of 5 = 0.06 in. Find 
the 95% and 99% confidence limits for the actual length. 

The 95% confidence limits are given by 


Since 


X±t 


0.975 


VjV-1 


we can find 


v = 9 


^ 0.975 - 2.26 

from Table 10-6 or directly using the definition of the t distribution. 

Using X = 4.39 and 5 = 0.06, we can be 95% confident that the actual mean 
will be included in the interval 

4.38 ± 2.26| a - 6 - - ) = 4.38 = 0.0452 in. 

\ VTO^T / 

The 99% confidence limits are given by 

X±t 0 J— =^=r) 

Where v = 9 and / 0995 = 3.25. Then with 99% confidence we can expect the 
actual mean to be included in the interval 

4.38 ± 3.25( - - - - - I = 4.38 ± 0.0650 in. 

WlO-1 / 


Now we work this problem, assuming that large sampling methods are 
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Table 10-6 Percentile Values ( t p ) for the Single-Sided 
t-Distribution with v Degrees of Freedom 


V 

* 0.995 

* 0.99 

* 0.975 

0.95 

* 0.90 

1 

63.66 

31.82 

12.71 

6.31 

3.08 

2 

9.92 

6.96 

4.30 

2.92 

1.89 

3 

5.84 

4.54 

3.18 

2.35 

1.64 

4 

4.60 

3.75 

2.78 

2.13 

1.53 

5 

4.03 

3.36 

2.57 

2.02 

1.48 

6 

3.71 

3.14 

2.45 

1.94 

1.44 

7 

3.50 

3.00 

2.36 

1.90 

1.42 

8 

3.36 

2.90 

2.31 

1.86 

1.40 

9 

3.25 

2.82 

2.26 

1.83 

1.38 

10 

3.17 

2.76 

2.23 

1.81 

1.37 

11 

3.11 

2.72 

2.20 

1.80 

1.36 

12 

3.06 

2.68 

2.18 

1.78 

1.36 

13 

3.01 

2.65 

2.16 

1.77 

1.35 

14 

2.98 

2.62 

2.14 

1.76 

1.34 


valid, and compare the results of the two methods. Using large sampling 
methods, we obtain the 95% confidence limits 

X± = 4.38 ± 1 .961 ) = 4.38 + 0.037 in. 

Vjv V Vio / 

where we have used the sample standard deviation 0.06 as an estimate of a. 
This is to be compared with 4.83 ±0.0452 using small sample statistics. 
Similarly, the 99% confidence limits are 


X± =4.38±2.58( 

y/N VVio 


= 4.38 ±0.049 in. 


as compared with 4.83 ±0.0650. 

In each case the confidence intervals obtained by using the small or 
exact sampling methods are greater than those obtained by using large 
sampling methods. The reason is that less precision is available with small 
samples than with large samples. 




278 


Statistics and Probability 


Example 10-6 Test scores of 16 students from one city showed a mean of 
107 with a standard deviation of 10, while the same test scores of 14 
students from another city showed a mean of 112 with a standard devia¬ 
tion of 8. Is there a statistically significant difference between the mean 
scores of the two groups at a 0.01 and a 0.05 level of significance? 

If jiij and | u 2 denote population mean scores of students from the two 
cities, we will see if we can reject the hypothesis 

H o : Vi = V 2 

That is, there is no statistically significant difference between the groups. 

The reader can easily show for himself that in the case of comparing two 
sample sets 

oVT/N. + i/N, 

where 


Then 




sf + N 2 $2 


+ n 2 -2 


1 16(10) 2 +14(8) 2 
16+14-2 


= 9.44 


- 112 lll? - --1.45 

9.44VT/16+ 1/14 

On the basis of a two-tailed test at a 0.01 level of significance, we would 
reject H 0 if t were outside the range —/ 0 995 to ^ 0 . 995 - For + N 2 — 2) 
degrees of freedom = (16+14-2) = 28, this range is —2.76 to 2.76. It 
follows that we cannot reject H 0 at a 0.01 level of significance. 

We would reject H 0 on the basis of a two-tailed test at a 0.05 level of 
significance if t were outside the range — 1 0975 to / 0975 . For 28 degrees of 
freedom this range is —2.05 to 2.05. Again we cannot reject H 0 at a 0.05 
level of significance. 

We conclude, therefore, that there is no significant difference between 
the scores of the two groups. 


Example 10-7 The standard deviation of the heights of 16 male students 
chosen at random in a school of 1000 male students is 2.40 in. Find 95% 
and 99% confidence limits of the standard deviation for all male students 
at the school. 



Chi-Square Testing 


279 


The 95% confidence limits are given by sV# /X 0975 and jViV /xoois- 
For 

1^=16 — 1 — 15 degrees of freedom 

Xo 2 975 = 27.5 
or 

X0.975 = 5-24 

and 

X0.025 = 6-26 
or 

X0.025 = 2.50 

Then the 95% confidence limits are 2.40 Vli5 /5.24 and 2.40 V 16 /2.50, 
that is, 1.83 and 3.84 in. Thus we can be 95% confident that the population 
standard deviation lies between 1.83 and 3.84 in. 

The 99% confidence limits are given by 

jVA^/xq.995 and sV~N /Xo.oos- 
For 

^=16 — 1 = 15 degrees of freedom 

Xo.995 = 32.8 
or 

X0.995 = 5*23 

and 

X0.005 = 4.60 
or 


X 0.005 2.14 

Then the 99% confidence limits are 2.40V16 /5.73 and 2.40V16 /2.14, 
that is, 1.68 and 4.49 in. 

Thus we can be 99% confident that the population standard deviation 
lies between 1.68 and 4.49 in. 

The number of degrees of freedom of a statistic are denoted by rj, which 
is defined to be the number of independent observation samples minus the 
number of population parameters that must be estimated from the sample 
observations. 
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CHAPTER 11 


THE PROGRAMMABLE 
POCKET CALCULATOR 


11-1 INTRODUCTION 

As mentioned in the preface, the premise of this book is that the pocket 
calculator provides the analyst with a new dimension capability. The 
programmable pocket calculator, in the author’s opinion, is yet another 
advance in pocket computing capability for the scientific analyst. 

From the analyst’s viewpoint, the most significant use of the pocket 
calculator may turn out to be as a teaching machine. The usual approach 
to learning a new discipline or a new technology involves four steps: 

1. Studying the discipline in the textbook fashion. 

2. Identifying or developing the mathematical tools that have been 
useful for solving the discipline’s problems. 

3. Working the “textbook” problems to learn the details and subtleties of 
the discipline through quantifying the problems with numbers, tables, 
graphs, and drawings. 

4. At least two to three years of application of the mathematical models 
by working in the discipline itself. 

The last step in this process takes one from the textbook-type knowledge to 
actual knowledge of the discipline itself. Here, the mathematical models 
are usually more complex, requiring many subtle considerations. For 
example, Fourier analysis as studied in the textbooks is often confined to 
Fourier series and Fourier integral representations of continuous functions 
defined on the entire domain of the reals. In practical harmonics analysis, 
the functions are usually finite in length and the data are usually 
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sampled, requiring “window carpentry” filtering of the data prior to 
conducting the harmonic analysis. These “practical problems,” while dis¬ 
cussed in textbooks, are rarely given the consideration that they require in 
actual data handling problems. 

With the advent of the programmable pocket calculator, the analyst now 
has libraries available for many disciplines other than his own. These 
libraries have been developed by persons highly experienced in their 
disciplines. By merely securing the standard library for a specific disci¬ 
pline, the analyst who has a programmable pocket calculator can, in a 
matter of weeks, become familiar with the programs and mathematical 
tools used in the discipline. Having acquired experience with practical 
programs, he can then focus his attention on the problems in his discipline 
and how to use the programs to solve them. In a sense the learning process 
is reversed. The analyst begins with the ability to numerically evaluate 
problems with which he is only vaguely familiar. These are practical 
problems, however, and involve mathematical models that have passed the 
test of time in practical analysis. Under the guidance of a person ex¬ 
perienced in this new discipline, the learning process is fast. 

The programmable pocket calculator is a good teaching machine also in 
that it is portable and the learning can be conducted in the comfort of 
one’s own home. In the past, the numerical evaluation of most practical 
engineering problems was usually done on either a digital computer or a 
programmable desk-top calculator at work (provided that the analyst could 
justify his request for a budget to run the computers). Now the analyst can 
study even the more complex aspects of any given problem or discipline at 
home, where most of us do our homework anyway. Furthermore, the 
analyst learns more quickly now because he spends most of his time 
thinking and deriving, with a minimum of effort (stroking the key strokes) 
on numerical evaluation. 

The next most important capability that the pocket calculator brings to 
the scientific analyst is the iterative computation of numbers and the 
preparation of extensive tables and graphs (involving many point pairs) for 
a more extensive set of problems than could be handled on the nonpro¬ 
grammable pocket calculator. For the many consultants and small en¬ 
gineering organizations that do not have a computer facility, the pro¬ 
grammable pocket calculator can bring to each member of the staff 
tremendous computing power. 

Scientists and engineers usually do not compute, but develop the formu¬ 
las that are used to compute the numbers and thus provide the insight to 
solve problems. The pocket calculator allows the analyst to begin with a 
top-level mathematical model of his process and refine it very quickly by 
testing the model numerically. Here he develops a system of equations that 
he thinks will describe a process or solve a problem and uses the pocket 



Hardware Considerations 


285 


calculator to numerically evaluate the equations so that the results can be 
compared with what is observed about the system. The analyst judges the 
degree to which the model can satisfactorily predict the behavior of the 
process. Any major discrepancies lead to revisions and improvement in the 
model. Thus, while Newton’s laws and Lagrange’s equations generate the 
equations of motion of a process that, when solved, will predict its 
behavior, the numerical comparison between the actual observed behavior 
and the predicted behavior may indicate that certain elements are left out 
of the models. In the Newtonian formulation of mechanics this would lead 
to more comprehensive free body diagrams to better understand the 
system and thus to derive better mathematical models. In the Lagrangian 
formulation of mechanics it would lead to the development of a more 
refined Lagrangian which would have more energy terms to account for 
the additional elements in the system. 

We see, then, that the pocket calculator does not improve the “ method ” 
for generating the equations of motion but helps to improve the mathema¬ 
tical model to which the methods are applied. This development of 
mathematical models using numerical testing is a convenient and fast 
operation with the programmable pocket calculator. There is no waiting 
for a batch-processed computer run to be made to get the data for 
improving the model. There are no “charge numbers” or budget required 
to permit the analyst to use the computer. On the programmable pocket 
calculator the cost of the run is in the “noise” of the electric bill. Finally, 
when an acceptable mathematical model is developed, the analyst can 
transfer the model to a magnetic tape strip and store the model for future 
use—a convenient means for conserving the energy spent preparing the 
mathematical model. Furthermore, the key data used in the analysis of a 
problem can also be stored on the magnetic tape for future reference. 

Thus the programmable pocket calculator also provides an effective 
means of documenting an analysis. The analyst can collect a magnetic tape 
library at relatively small expense, requiring minimal storage space at 
relatively low cost. 

Finally, the programmable pocket calculator provides the engineer with 
portable low-cost computing power for use in the field, in his car, at his 
customer’s location, or in the convenience of his home. 


11-2 HARDWARE CONSIDERATIONS 

The programmable pocket calculator has the following parts: 

1. The arithmetic unit, that is, the combination of registers that perform 
the arithmetic. 
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2. The memory, which stores numbers and instructions as programmed 
from the keyboard or from stored programs on magnetic tape. 

3. The firmware associated with the calculator, that is, its “hard wired” 
programs and instruction set that are already built into the calculator. 

Calculators perform numerical calculations only, as opposed to compu¬ 
ters, which are alpha-numeric data processors. Today’s programmable 
calculator, and any that might be expected in the near future, will only be 
limited in the fact that they perform numerical calculations and not 
alpha-numeric operations. Apart from this, the pocket calculators are 
similar to digital computers. Specifically data "can be stored in memory, 
recalled to the arithmetic registers or arithmetic unit, and processed and 
restored in memory following the sequence of preprogrammed operations. 
An essential and interesting difference between the typical calculator and 
its digital computer counterpart is that many calculators operate in dec¬ 
imal rather than binary arithmetic. The reason is that decimal arithmetic 
involves less electronics for the special-purpose calculators than would 
conversion from decimal to binary and back again, as on general-purpose 
digital computing machines. Memories therefore are often set up in integer 
multiples of 10, as is the number of registers in the computing machine. 
For example, certain pocket calculators have one constant storage register 
and four arithmetic registers, in which the register arithmetic is performed. 
Ten additional storage registers are available in an advanced model of the 
basic calculator. The HP-65, has 100 programmable steps that can be put 
into memory. 

The memory in most programmable calculators can be expected to be a 
set of registers in conjunction with the operating stack for performing 
register arithmetic and for scratch-pad storage during the execution of a 
program. From the standpoint of memory for storing numbers, there are 
only the registers in the stack plus the scratch-pad registers for number 
storage and manipulation. For example, if there are 9 scratch-pad registers 
and 4 stack registers, there are 13 total storage locations for storing 
numbers generated by a program. There is, however, memory for integer 
multiples of 10 keyboard instructions. For example, in the HP-65, 100 
instructions can be stored in the calculator for sequential operations. That 
is, a program of 100 key strokes on the keyboard of the machine can be 
stored and executed automatically. Though numbers can be programmed 
into the calculator, using the 100 storable key strokes is relatively in¬ 
efficient. Instead, the numbers can be input into the scratch-pad memory 
directly as opposed to inputting a 13-digit number into memory with 13 
key strokes. This is perhaps the only important distinction between pro¬ 
grammable pocket calculators and the standard digital computer. The 
programmable pocket calculator can be expected to store about 100 to 
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100,000 key strokes, not 13 -digit numbers. Thus when we say that a digital 
computer has 32 k 16-bit words or that a desk-top calculator has 4k 12-bit 
words, for the pocket calculator we say that it can store 100 or so key 
strokes. This might seem somewhat limiting, but actually most pocket 
calculator problems involve fewer than 100 key strokes. With a 100-stroke 
memory capability we can evaluate rather advanced mathematical func¬ 
tions and program fairly sophisticated iterative procedures for solving 
difficult problems. 

The speed with which the pocket calculator processes the 100 instruc¬ 
tions varies from calculator to calculator. From 10 to 1000 instructions per 
second can be expected from present-day pocket calculator electronic 
circuitry. 


11-3 FIRMWARE 

The firmware consists of the instruction set built into the pocket calculator 
and “ called ” from its keyboard. The basic instruction set usually contains 
all of the functions on the keyboard of the scientific calculator and a set of 
special functions associated with the programming aspect of the pro¬ 
grammable pocket calculator. These include the following: 

1. The GO-TO instruction. This instructs the calculator to perform the 
instruction at the nth step in the stored program. Thus GO-TO 50 would 
tell the computer to perform the instruction at the fiftieth step in the 
program. 

2. The JUMP instruction. This instructs the calculator to jump the next 
two steps. It is expected that this instruction will be a natural part of all 
programmable calculators, and the two steps that are skipped are usually 
GO-TO type instructions. Thus the JUMP instruction with the GO-TO 
instruction permits the calculations to be looped iteratively in the com¬ 
puter program. 

3. The DECREMENT AND JUMP ON ZERO instruction. This in¬ 
struction, which can reasonably be expected in programmable calculators, 
examines the contents of one of the scratch-pad storage registers. If the 
register is not zero, it decrements the register by 1 and continues. When the 
register is zero, it will perform the JUMP operation. 

4. The LOGICAL or TEST FLAG instruction. The flag can be set equal 
to 1 or zero, thus controlling the data flow in a calculator program, based 
on whether the flag is 1 or zero. Usually, the test flags or Booleans can be 
set manually on the keyboard or, since it is a keyboard instruction, with 
the program. 
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5. The STOP instruction. This is an instruction to stop the program. 

6. The TEMPORARY STOP or RUN/STOP instruction. The calculator 
is told temporarily to stop, usually for the purpose of data input or data 
output. 

Other keyboard instructions that can be expected to be found on the 
typical programmable pocket calculators are the DELETE function, the 
NO-OP function, and the SINGLE-STEP function. The SINGLE-STEP 
function permits the program to be processed or reviewed a single step at a 
time. This is for the purpose of debugging the program and examining or 
modifying the program by stepping up to the location in the instruction 
sequence that is to be modified or changed and the DELETE function 
instruction used to delete the previously programmed instruction, leaving it 
available for reprogramming. Finally, the NO-OP function can be used to 
fill memory with an instruction not to perform an operation. In this way, 
the remaining steps of a program can be safeguarded against accidental 
programming of the instruction sequence with undesirable program steps. 

The firmware in a programmable pocket calculator can also include a 
keyboard for performing user-defined functions—functions that are pro¬ 
grammed in a normal mariner by a sequence of key strokes telling the 
calculator how to execute the function. Of the firmware just discussed, 
only the latter uses part of the programmable memory; the former func¬ 
tions are part of the keyboard sequence and thus are designed into the 
electronics of the calculator. 


11-4 SOFTWARE 

The software in programmable calculators is usually a magnetic tape strip, 
a magnetic card strip, or a tape cassette that is used to both read in and 
read out data and instructions from- or -to the memory of the calculator. It 
can be expected that manufacturers will provide preprogrammed software 
for performing analysis for many disciplines. In fact, it is precisely this 
software that permits a single pocket calculator to be programmed to 
perform special-purpose calculations in many disciplines. In a discussion 
with the Chief Engineer on the HP-65 Program, Chung Tung, the author 
was informed that it was precisely this motivation that led Hewlett- 
Packard to develop the HP-65, the first of the programmable pocket 
calculators. 

The software associated with any pocket calculator would usually be 
developed so that problems involving more than 100 instruction sets and 
requiring more than the scratch-pad storage provided by the stacks plus 
scratch-pad memory can be programmed on a series of mag tapes or mag 
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card strips. In attempting to see how far this process could be carried, the 
author programmed an 11-card sequence on the HP-65 for executing the 
lateral and vertical channels of an automatic landing system simulation 
which involves the numerical integration of a 14th-order continuous 
dynamic process, including saturation limits and other hard-stop non- 
linearities. While somewhat impractical to use, it does point out the 
flexibility of this method when general-purpose computing machines are 
not available—when the calculation is required in the field or when 
research is being conducted away from the computer center. 

A similar procedure can be used for storing data in excess of the 100 key 
strokes plus the limited scratch-pad storage available in the pocket cal¬ 
culator. 


11-5 PROGRAMMABLE POCKET CALCULATOR TECHNIQUES 

The basic procedure for solving a problem on the programmable pocket 
calculator is as follows: 

1. Definition of the problem. The generic types of problem that are 
conveniently solved on the pocket calculator are data processing (which 
includes interpolation, extrapolation, and filtering), the numerical evalua¬ 
tion of functions, the solution to systems of equations (whether algebraic 
or differential), the simulation of continuous processes, the frequency- 
domain analysis of data, and the statistical analysis of data. All these 
topics are covered in this book. 

2. Preparation of a math flow of the sequence of key strokes required. For 
this the equations for solving the problem must be determined and the 
sequence of key strokes to numerically evaluate the equation must be 
worked out in a form that can be solved explicitly, implicitly, or by a 
combination of both. The preparation of the math flow will, by definition, 
identify the control operations for automatic execution of the key strokes. 

3. Programming of the calculator by keying in the key stroke sequence, 
including control operations. Once the program is stored in memory, it is 
useful to load it onto a mag-tape strip so as not to inadvertently destroy 
the program. It is reasonable to expect that programmable calculators will 
have an ERASE BEFORE WRITE tape load function. Thus reprogram¬ 
ming or redefining the program or modifying the program can be restored 
on the same mag-tape strip or cassette by simply reloading the program on 
the mag-tape strip. 

4. Verification and checking of the program by tests with all numerical 
values set equal to zeros, 1, or a single sequence of numbers that permit 
testing the program and its loops. 
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5. Running the sequence automatically with the actual problem data. 

As an example, consider the problem of analyzing a low-pass filter. 
Figure 11-1 shows the three steps in the mathematical modeling process. 
First, the physical block diagram model of the process is drawn, including 
all of the hardware elements, the system inputs, and the system outputs. In 
this particular case we have a high-gain amplifier with impedance networks 
on the forward and feedback loops, which result in the passage of the 
low-frequency components of the input signal to the output while attenuat¬ 
ing the high-frequency components. The mathematical block diagram of 
this filter in Laplace transform notation is the second step in the modeling 
process shown in the figure. The frequency response of this filter can easily 
be determined by replacing S with jo) and computing the transfer function 
of the filter algebraically. For the more difficult problem of determining 
the time-domain response of the filter to arbitrary forcing functions, it is 
necessary to prepare the differential equation that models this physical 
program. This is the third step in the modeling process shown at the 
bottom of Figure 11-1. The next step is to prepare a math flow for the 
low-pass filter mathematical model. 

The math flow visualizes the way in which the problem is intended to be 
solved on the programmable pocket calculator. As shown in Figure 11-2, 
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Ci — AV — 
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e out 
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Figure 11-1 Low-pass filter mathematical model. S = Laplace transformation operator. 
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Figure 11-2 Math flow of low-pass filter mathematical model. 


the first task is to initialize the problem with the state variables and 
parameters in the problem, together with the coefficients involved in the 
numerical integration process for solving the differential equations of 
motion of the low-pass filter and the control variable that is used to 
determine the computing path within the math flow. In this case the initial 
state vectors are the initial values of the filter input and output, the 
parameters are the resistance and capacitance of the filter, the parameter 
associated with the numerical integration process is the integration step 
size, and the control variable is the number of steps that we will take 
through the system of differential equations to compute the filter’s re¬ 
sponse at a time nT. 

After traversing the initialization path, the calculator is programmed to 
compute the rate of change of motion of the filter output. On computing 
the rate, the next step in the math flow is to compute the new value of the 
filter output, which is then followed by a test to determine if the calcula¬ 
tions are to be stopped. In this example we ask if 100 passes through the 
system of equations have been taken. When the answer is yes, results are 
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displayed in the display register. If the answer is no, another pass through 
the system of equations of motion is made (after renaming the variables 
and computing the forcing function for the next step, of course). In this 
example we simplify the situation by examining only the step response of 
the filter. The calculation of the forcing function in the closed-loop 
feedback process then is not necessary. This has no effect on the general 
nature of this discussion, since computing the forcing function is a 
straightforward process when it is necessary. 

Table 11-1 illustrates the third step in the problem-solving procedure— 
preparing a key stroke sequence that can be programmed on the pocket 
calculator. In this example, the HP-65 calculator was used and some 
attention must be given to the details of its implementation on the HP-65 


Table 11-1 Preparation of a Key Stroke Sequence 


Math Flow 

Key 

Stroke 

Math Flow 

Function 

Initialization Path 

LBL-T 

Label the initialization path A 


R/S 

STO-1 

Stop then input and store y 0 


R/S 

STO-2 

Stop then input and store x 0 


R/S 

STO-3 

Stop then input and store R, 


R/S 

STO-4 

Stop then input and store Rj 


R/S 

STO-5 

Stop then input and store C, 


R/S 

STO-6 

Stop then input and store T 


R/S 

STO-8 

Stop then input and store n 

Identify loop closure points 

LBL-1 

Label this step “1” 

Compute rate 

RCL-2 

Recall x 


RCL-3 

Recall Rj 


RCL-4 

Recall Rj 


+ 

Ri/Ff 


X 

(R,/R f )x 
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to understand this step of the procedure for solving problems on the 
programmable calculators. Though this key stroke sequence was pro¬ 
grammed on the HP-65, it is typical of the key stroke sequences one would 
expect to encounter on most programmable pocket calculators. 

The first 15 steps shown in Table 11-1 are the steps taken along the 
initialization path. The sequence begins by labeling the initialization path 
A to distinguish it from the normal feedback path which begins at step 16 
and is labeled 1 (it identifies the point at which the feedback path loop 
closure occurs). The steps 2 through 15 involve automatically stopping to 
input a number on the keyboard and then manually starting the program 
again to store the keyboard number in memory. For example, step 2 stops 
the automatic program sequence so that the variable Y 0 can be input on 


Table 11-1 ( Continued) 




Math Flow 


Key 


Math Flow 

Stroke 

Function 

Compute rate 

RCL-1 

Recall y 


CHS 

-y 


+ 

{Ri/R f )x-y 


RCL-3 

Recall R t 


RCL-5 

Recall C ; 


X 

R,Ci 


+ 

( R,/R f )x-y 



R,Ci y 

Compute state 

RCL-6 

Recall T 


X 

£ 

II 

3 


RCL-1 

x =y + Ay 


+ 

STO-1 

y n =y n +1 

Test for problem 

DSZ 

Test register 8 for zero. If zero, 

being solved 

Go through 


skip next step and proceed 

equations of 

motion again if not solved 

GTO-1 

If R-8 not zero then go to step “1” and decrement 
R-8 by one and go to 1 

Display results 

R/S 

Display y 


294 


The Programmable Pocket Calculator 


the keyboard and then stored in location 1 (step 3) when the RUN-STOP 
key is again stroked (step 2). When operating, the computer will automati¬ 
cally progress to the first RUN-STOP (which is step 2), will stop, the 
variable F 0 is manually input through the keyboard into the display 
register, and when the RUN-STOP key is stroked, the variable Y 0 will be 
stored in memory register 1 (step 3). The computer will then automatically 
proceed to step 4 where it will stop to await keyboard input of X 0 and the 
associated RUN-STOP key stroke to allow the program to proceed to step 
5 which is to store the contents of the display register in memory register 2. 
This procedure of initializing the program continues until step 15 where 
the loop closure point is identified by labeling that particular step as step 1. 

The next 12 steps compute the filter output rate according to the 
differential equation shown in Figure 11-2. Updating the filter output is 
done in the next five steps of the program. Then a test is made to see if 100 
passes through the equations of motion have been completed. This is the 
step beginning with the key stroke DSZ, which means “decrement and skip 
on zero.” The function of the DSZ is to test register 8 for zero. If register 8 
is zero, the next step in the program will be skipped. In this example if 
register 8 is zero, the calculator will jump over the GO-TO instruction and 
go immediately to the RUN-STOP instruction where the program will stop 
and the latest value of Y will be displayed in the display register. If, 
however, the contents of register 8 are not zero, then the program will not 
skip the GO-TO instruction—it will go to the step labeled 1 and simul¬ 
taneously the contents of register 8 will be decremented by 1. In Hewlett- 
Packard’s implementation of the DSZ function, register 8 is used for the 
contents of the number of steps to be made in an iterative procedure. For 
other pocket calculators, it can be expected that other implementations of 
this DSZ function can be made. They all have one thing in common, 
however: the decrementing of some register and skipping the next step if 
the register’s contents are zero. If it is not zero, the next step will not be 
skipped; however, the contents of the test register will be decremented by 
1 . 

The careful reader will remember that the author recommends terminat¬ 
ing an iterative procedure based on a test of the number of iterations if at 
all possible. Again, the reason is that for many iterative procedures an 
estimate of the number of steps to solve the problem can usually be made. 
This solution often gives insight into the convergence properties of the 
problem, which is helpful in establishing confidence in any result. For 
these problems where estimates of steps that should give the solution are 
known, the DSZ function is a “natural” test procedure and thus is 
particularly important in pocket calculator analysis. 

Table 11-2 shows three static check cases used to check this program. 
Simple static tests of a program can often be made with numbers that are 




Programmable Pocket Calculator Techniques 


295 


Table 11-2 Preparation of a Check Case 

Check Case 1 

Check Case 2 

Check Case 3 

Let 7 0 = 0 

*o-l 

*o=l 

*o-l 

*0=1 

*o = 0 

tf-1 

*,=1 

1 

*/= 1 

*/= 1 

R f =2 

Cf- 1 

C/-1 

C f = 3 

T= 1 

T= 1 

T=2 

n= 1 

n= 1 

n = 1 

Then 7 = 1 

7=0 


AT= 1 

o 

II 

<1 

A7= —f 

n-i 

*■-1 

7 = - 

1 n 3 

Display Y n = 1 


Display Y n = 1 


Display 7„ = i 


quite unlike the physical characteristics of the process being studied. In 
this particular set of check cases, only zeros and 1 were used in the first 
two cases and zero through 3 for the third. It is important to develop 
dynamic check cases by either using an alternate means to solve the 
problem or a predetermined analysis of a simplified version of the prob¬ 
lem. This is not shown as our straightforward example. The material 
covered in Chapter 7 is an example of the numerical methods that can be 
used to generate an independent check case on the dynamics of the 
solution of problems involving this type of differential equation. The idea 
would be to compare the results of a solution generated with a recursion 
formula with those of the solution generated here by using Euler numerical 
integration. 

Table 11-3 illustrates the fifth step in the problem-solving procedure 
where the computer makes 100 passes through the equations to compute 
the filter’s response at 1 second, with the filter design parameters being 
varied. The reader may be interested to know that the material for this 
example was developed and programmed and the sequence of solutions 
was run in approximately 17 minutes. The static check cases were run in 20 
seconds, and the three 100-step solutions were run in approximately 3 
minutes. 

Further illustrations of the power of the programmable pocket calculator 
to solve problems are given in Chapter 12, where optimization problems 
with the penalty function method for handling equality constraints are 
programmed and example solutions are run to exemplify the calculator’s 
use for this type of analysis. 
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Table 11-3 Running of the Automatic Sequence to Study the Unit Step Response 
of the Filter 



Examples of 100-Step Solution Cases at T- 

= 0.01 

Time 

R,= 1 Mi 2 

R t = 2 MSI 

R, = 2 MSI 


Rj= 1 MSI 

R f = 2 MSI 

R f = 1 MSI 


C,= lju/ 

Cj— 1 nf 

c,= 1 M / 

1 second 

y = 0.63396766 

7 = 0.39422956 

7=1.26793532 


It is worth pointing out that as the calculator is programmed, the key 
strokes are displayed according to their row-column location on the 
keyboard. For example, if the key that is at the intersection of the third 
column of keys and the second row of keys is depressed, a “32” is 
displayed in the register window. In this way, the programmer can monitor 
the programming of the process to ensure that the desired program is being 
stored in memory. This is also used in conjunction with the single-step key 
to review a program that is already in memory and, when necessary, to 
single-step up to the point where a change is to be made. 

Relational tests that are not used in this example, but which can be 
expected in programmable pocket calculators, include those of whether a 
register is greater than, equal to, or less than the contents of another 
register. In the Hewlett-Packard 65 implementation, the relational tests are 
conducted in conjunction with the ninth memory register. As mentioned 
before, with the DSZ function it can be expected that other implementa¬ 
tions will be available in other programmable pocket calculators. 

Finally, a point well worth making is that in the preparation of any 
computer program on any programmable calculator (where there is more 
than enough memory for the problem), it is advisable to include additional 
RUN-STOP operations in long programs to display intermediate results 
while writing and checking the problem. When the program is finally 
checked out, the unwanted stops can be deleted. The deletion procedure is 
simply to single-step to the RUN-STOP and then use the DEL instruction 
to eliminate the RUN-STOP instructions used for checkout purposes. 


11-6 METHODS OF ANALYSIS ON THE PROGRAMMABLE POCKET 
CALCULATOR 


There are three basic types of numerical methods for solving problems on 
the programmable pocket calculator. In the explicit method the equations 
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to be numerically evaluated are simply programmed on the calculator, thus 
eliminating the need for manually working out the sequence of key strokes 
to solve the problem. A manual optimization problem is a good example. 
Assume that the top-level cost model for some satellite programs takes the 
form* 


C = n 


30 + 



+ 4 


30 + 


M- 1 
1.5 


30 


research, 

development, 


refurbishment 


+ 0.4 


30 + 


M — 1 
1.5 



T_ 

M 


+ 2 T + 

ground support 


launch costs 

22 T 
M 


tests, and 
engineering 


which involves 69 key strokes for their numerical evaluation and the use of 
four scratch-pad storage locations. The program is tabulated in Table 11-4. 

It is clear from the cost model that the mean mission duration of the 
satellite plays a dominant role in the cost equation. If the mean mission 
duration is small, the number of launches (T / M) is large, and the cost 
associated with each launch results in high total program cost. If the mean 
mission duration is large, the cost associated with the design and devel¬ 
opment of the satellite is large, which also leads to a high total program 
cost. Clearly, somewhere in between is a minimum total program cost. To 
determine it, we use a sequence of solution values for the cost equation, as 
shown in Table 11-4. It is apparent that a satellite mean mission duration 
of — 1.25 years minimizes the total program cost. 

The preparation of Table 11-4 on the programmable pocket calculator 
involved 61 key strokes to program the calculator and 100 key strokes for 
data entry and manual program iteration. The entire procedure took 14 
minutes, including checkout. When the table was manually prepared 
without using the programming feature of the calculator, the table took 
approximately 45 minutes to prepare. While the time saving shown here is 
typical of pocket calculator analysis, what is not shown (but what is 
equally important) is that had the total program cost model given unex¬ 
pected or unexplained results that would have required its modification, 
only 3 minutes would have been required to incorporate the cost model 

*n = number of satellites; 

M = satellite mean mission duration; 

T — total program lifetime; and 
C — total program cost-millions of dollars. 
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Table 11-4 Total Program Cost for the XYZ Satellite Program 


Number of 
Satellites 

Program 

Duration 

(years) 

Mean Mission 
Duration 
(years) 

Total Pro¬ 
gram Cost 
(millions) 

2 

5 

0.25 

650 

2 

5 

0.50 

453 

2 

5 

0.75 

401 

2 

5 

1.00 

385 

2 

5 

1.25 

383 

2 

5 

1.50 

389 

2 

5 

1.75 

399 

2 

5 

2.00 

411 

2 

5 

2.25 

425 

2 

5 

2.50 

440 

2 

5 

2.75 

456 

2 

5 

3.00 

473 

2 

5 

3.25 

490 

2 

5 

3.50 

508 

2 

5 

3.75 

525 

2 

5 

4.00 

544 

2 

5 

4.25 

562 

2 

5 

4.50 

580 

2 

5 

4.75 

599 

2 

5 

5.00 

618 


modifications and to prepare a new Table 11-4. With the programmable 
calculator, the modification would have been reprogrammed only for that 
part of the program where it was required.. The entire program need not 
necessarily be rewritten. Then only an additional 100 key strokes would 
have been needed to prepare another version of Table 11-4. 

The second method of problem solving is the implicit method. An 
implicit equation is prepared and solved as discussed in Chapter 9 on 
determining zeros of a function. The procedure there would be to program 
the iterative procedure so that the solution to the implicit equation satisfies 
an error criteria established by the analyst. 

The final procedure is neither implicit nor explicit. It is simply a 
brute-force search for the solution to an equation or system of equations 
by systematically testing regions where the solution is expected to exist and 
retaining only the value (or values) in the region that best satisfies the 
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equation to be solved. Of the three methods, the latter is the most 
systematic, involving the least number of calculations and taking maxi¬ 
mum advantage of the programmability feature of the pocket calculator. 
The only test that needs to be done is to determine whether the equation, 
when a solution is computed at a test point, is better satisfied with the test 
solution currently being used than with any previous test solutions. If it is, 
the new test point is stored in memory and the old one erased (or retained 
if it is desirable to monitor the convergence of the process). If it is not, the 
systematic search algorithm proceeds to the next test point, retaining the 
best previous test point. Of the iterative implicit and systematic search 
methods, the latter is the least efficient but involves the fewest program¬ 
ming steps, while the former method is more sophisticated, requiring 
logical tests and search algorithms, such as Newton’s method. 

From the analyst’s viewpoint, the explicit mode of computer solution, 
where the analyst is involved in selecting the conditions to substitute into 
the computer program (manual iteration), is at best a gross procedure but 
requires only a few quick iterations, since the manual interaction will lead 
to a closing in on the gross solution fairly rapidly. The implicit method 
results in solutions that are difficult to develop with man-machine interac¬ 
tion because the precision with which the solution is to be determined is 
beyond the level at which the manual interaction can easily guess a better 
solution than a preprogrammed solution search algorithm (see Chapter 12). 

Finally, the third method, while systematic and simple to program, 
results in the least efficient and least accurate solution to the problem. The 
accuracy can be refined through refined grids of possible solution values. 
The technique can be used for finding zeros of complex functions, such as 
those described in Chapter 9 on finding zeros of a function. It is a very 
practical and useful method when only a rough answer is required for a 
problem that takes a lot of key strokes to evaluate. Also, it is mentioned 
here as an example of the simplest form of problem solving available on 
the pocket calculator at a low programming overhead penalty. 



CHAPTER 12 


OPTIMIZATION 


12-1 INTRODUCTION 

No discussion of the programmable pocket calculator is complete without 
consideration of its optimization capabilities. The optimization problem 
has gained significance in engineering in the last few decades because it 
identifies the limit that practical design could approach if resources were 
unlimited. Practical engineering design usually is suboptimum design; the 
optimum is of vital importance nevertheless, since it identifies the ultimate 
design limit and optimum system capability. 

Here we do not cover what is perhaps the key issue in any optimization 
work—the determination of what is to be optimized. Specifying precisely 
the payoff function in any systems analysis is a practical matter. It is 
perhaps the most difficult aspect of all systems analysis in that the 
computational analysis, once the payoff function has been identified, is 
almost a trivial matter in comparison to selecting the payoff function itself. 
In fact,, commonly a number of payoff functions are identified and a 
system is optimized from a number of different viewpoints. The result is a 
group of optimized systems, which are studied to identify the most practi¬ 
cal system. 

Whatever the way in which optimization is applied in systems analysis, it 
is the analyst who must quantify the optimum system, from the 
standpoints of both its characteristics and its payoff. We therefore proceed 
to reexamine the fundamentals of optimization—for only the three simpl¬ 
est optimization problems: the parameter optimization without constraints, 
the parameter optimization with equality constraints, and the parameter 
optimization with inequality constraints. Though the simplest of the op¬ 
timization problems, they are among the most frequently encountered. 
Also, they involve smaller programs than do the more sophisticated op- 
300 
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timization problems and thus can often be handled on the pocket calcula¬ 
tor. For these reasons, then, we reexamine the fundamental concepts of 
optimization to illustrate the optimization process by means of math flow 
and specific problems that can be programmed on the pocket calculator. 

The approach used here is to first develop the mathematical concepts of 
these simple optimization problems and then discuss their numerical 
evaluation. The intention is to reacquaint the reader with the concepts of 
constraints, Lagrange multipliers, and the optimization terminology. 


12-2 MAXIMA AND MINIMA 

In most systems analysis the optimization problem amounts to maximizing 
the payoff function. This function is usually of the form “benefit divided 
by cost.” As a system is developed and increasingly more money is spent 
on it, the benefits usually follow the law of diminishing returns. This is 
seen in Figure 12-1. It is also true that the benefits are usually accrued on a 
discrete basis as fixed amounts of money are spent on the system, rather 
than being continually accrued. It is apparent from Figure 12-1 that the 
benefit-to-cost ratio takes the shape shown in Figure 12-2. It is to the 
analyst’s advantage, therefore, to maximize the cost-benefit curve or the 
payoff function in terms of the benefit-cost ratio. 

From a more mathematical viewpoint, optimization involves either 
maximizing or minimizing a function /(x f ). Specifically the objective is to 
identify those values of that cause/(*,.) to be a minimum or a maximum. 
Strictly speaking, we need only consider either the minimization or the 


System 

benefit 



Figure 12-1 System benefit as a function of system cost. 
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System cost 

Figure 12-2 Cost benefit as a function of system cost. 


maximization problem, but not both. The reason for this is that the values 
of X; which maximize /(.*.) also minimize —/(*,). The maximum of f(x ) 
occurs at the same place as does the minimum of —/(*). We therefore 
discuss the optimization problem from the viewpoint of either extremum, 
but never both. 

Perhaps the most familiar case in optimization is when the extremum of 
a single dependent variable is a function of a single independent variable. 
For a function of one variable, this means finding the point at which the 
derivative is zero and evaluating the function at that point. The value of 
the independent variable where the derivative is zero is only a necessary 
condition that the function be at an extremum; it is not sufficient. For 
example, a function can have a derivative equal to zero at a stationary 
point and not at a maximum or minimum. Thus it is necessary to check the 
second derivative to determine whether it too is at zero (a point of 
inflection). If it is not, the second derivative can be used to determine 
whether the extremum is a maximum or minimum depending on whether 
the second derivative is negative or positive. Thus the condition for the 
extremum of a single variable is 


o 

Jl^ 

(12-1) 

If —,-/(/)> 0, it is a minimum. 
dt l 

(12-2) 

If ■—/(/) < 0, it is a maximum. 
dt 4 ' 

(12-3) 
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And if (d 2 /dt 2 )f(t) = 0, it is a point of inflection. 
For a two-variable function what is required is 



(12-4) 

i^-° 

(12-5) 


Single variable optimization on the programmable pocket calculator was 
discussed in Chapter 11. It is worth mentioning again, however, that f(x ) 
can be conveniently programmed on the calculator and a search (manual 
or automatic) for the x that minimizes f(x ) can be quickly done. For 
complicated f{x), the numerical value of the minimum f(x) may be found 
more quickly on the calculator in this manner than by the analytical steps 
just outlined. 

Optimization of functions of more than one variable force us to change 
notation at this point. In what follows, we use the notation of Bryson and 
Ho*: 


= parameter vector 


( 12 - 6 ) 


We concern ourselves with the parameter optimization problems that 
involve finding the values of the m parameter x v x 2 ,.. .,x m minimizing a 
payoff function that is a function of these parameters. We write the payoff 
function in the Lagrangian notation 

L(x l ,x 2 ,...,x m )=L(x) (12-7) 

The use of the programmable pocket calculator in solving optimization 
problems is presently limited to two or three dimensions, but it is quite 
useful for higher-dimension problems in computing “parts” of the problem 
as subroutines. In any case the pocket calculator permits optimization 
analysis of some complexity. 


*See reference. 
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12-3 PARAMETER OPTIMIZATION WITHOUT CONSTRAINTS 


If there are no constraints on ^ and if the function L(x) has first and 
second partial derivatives, the necessary conditions for a minimum are 


and 


3L =0= 8L 
3jc dXj 



( 12 - 8 ) 


(12-9) 


Here we mean that the matrix whose components are d 2 L/dx^Xj must 
have eigenvalues that are zero or positive. All jc that satisfy dL/dx = 0 are 
called stationary points. When 


!^>0 ( 12 - 10 ) 

dx*■ 

at the stationary points x s , L(x s ) is at a local minimum. If d 2 L/dx 2 = 0 at 
jc = it is not possible to establish whether the point is a minimum. Such 
a point is called a singular point. 


12-4 PARAMETER OPTIMIZATION WITH EQUALITY CONSTRAINTS 

A more general optimization problem is to find the values of the “control 
parameters” that minimize a payoff function 


L(x l ,...,x n ;u l ,...,u m ) (12-13) 

where the n parameters x v ...,x n are determined by 

f l {x l ,...,x n ;u l ,...,u m ) = 0 (12-14) 


(12-15) 
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Now, let 


x = 


= parameter vector 


(12-16) 


= control vector 


(12-17) 


/= 


/, 

/„ 


= constraint vector 


(12-18) 


Then the optimization problem is to find the vector u that minimizes 

L(x,u) (12-19) 

where the vector x is related to u according to the constraint equation 

/(x, w) = 0 (12-20) 

For a given optimization problem, the choice of which parameters to use 
as control parameters is not unique. The choice must be such that u 
determines x through the constraint equation. 

A stationary point is one where dL = 0 for arbitrary du, while holding 
df— 0 (letting dx change as it will). Then we have 

dL — L x dx + L u du (12-21) 

and 

df=f x dx + F,du = 0 (12-22) 

Equation 12-12 may be solved for dx: 

dx=-s:\du 


(12-23) 
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By substitution, then, we have 


dL = (L u — Lj~ '/„) du 


At the stationary point we see that for dL~ 0 for any du 


L u ~LJ~'f u = 0 


(12-24) 

(12-25) 


These equations together with the constraint equations determine the u 
and x at stationary points. 

Another technique is to adjoin the constraints to the payoff function by 
a set of n “undetermined Lagrangian multipliers,” \ 1 ,...,X H , as 

H(x,u,X) = L(x,u)+ 2 XJ i (x,u) = L(x,u) + X T f(x,u) (12-26) 

i = i 


If we choose u (and thereby x through the constraint equations) so that 
L= H, and if we choose X according to 


then 


A r = 


dL 

dx 



-1 


and 



dL= dH= ^--du (12-27) 

du 

Thus dH/du is the gradient of L with respect to u while holding f(x,u) = 0. 
At a stationary point, dL vanishes for arbitrary du\ which can happen only 
if 


dU 

du 






= 0 


du du 

Hence a stationary value of L(x,u ) must satisfy the equations 


(12-28) 


/(*.«) = 0 (12-29) 

^=0 (12-30) 



(12-31) 


H — L(x,u) + X T f(x,u) 


(12-32) 


where 
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12-5 THE GRADIENT METHOD 

When L(x,u ) and/(*,«) are complex, numerical methods must be used to 
determine the values of u that minimize H. Perhaps the most commonly 
used numerical method is that of steepest descent for finding minima. 

Gradient methods are iterative algorithms for estimating u, so as to 
satisfy the stationary conditions dH/du = 0 (see Figure 12-3). 



Figure 12-3 Gradient method. (&u) l = — k(dH } /du), (A») 2 = — k(dH 2 /du). Note: search 
can overshoot when — &(3/f/3u)»0. 


One procedure for using the gradient method is: 

1. Estimate the values for u. 

2. Compute x from f(x,u) = 0. 

3. Compute Afrom \ T = - (dL/dx)(df/dx)~\ 

4. Compute dH/du = (dL/du)+\ T (df/du). 

5. Revise the estimates of u by amounts Aw= - K(dH / du) T (K is a 
positive scalar constant). 

6. Iterate, using the revised estimates of u, until (dH / du)(dH / du) T is 
very small. 
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The gradient method for finding a minimum is a hill-descending technique. 
Starting with an initial guess of w, a sequence of changes Aw is made. At 
each step Aw is in the direction of the gradient dH/du whose magnitude 
gives the steepest slope at that point on the hill. The choice of K involves 
judgment to ensure that the linearized prediction will be accurate and the 
process will be efficient (i.e., will not require many iterations). K will 
usually be varied in the sequence of iteration when it is thought that the 
minimum is near. 


12-6 COURANT’S PENALTY FUNCTION METHOD 

Another numerical method for optimizing with either equality constraints 
or inequality constraints is the Courant penalty function method. Suppose 
that we wish to minimize L{u) subject to 

/(h) = 0 ( 12 - 33 ) 

For the penalty function method, we minimize 

l = l («)+ a :||/(«)|| 2 ( 12 - 34 ) 

subject to no constraints! Here K is large. If L attains a minimum aty 0 , it 

is reasonable to expect that 

/(« o)~0 ( 12 - 35 ) 

and 

f^(«„)~0 ( 12 - 36 ) 

Computationally, the penalty function method is easy to use and under¬ 
stand and has been used with great success in certain parameter optimiza¬ 
tion problems. The Courant penalty function method does not always 
work, however, because large values of K tend to make a long, narrow and 
deep depression in the field of L(w) with the stationary point at the bottom 
of the depression. The problem with this is that the gradient is more likely 
to be evaluated on the sides of the depression than the end of the 
depression. This will result in estimates of the stationary point that jump 
back and forth across the narrow depression instead of running down the 
length of the depression. For example, minimizing 


L = U,-2) 2 +^ 2 


(12-37) 
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subject to^j = 0 has y l =y ; 2 —0. The penalty function method minimizes 


-2) +yj + Ky]=yl + 


(12-38) 


Contours of constant L are ellipses with centers at >^ = (2/1 + K), y 2 -0. 
Figure 12-4 shows contours of constant L. 

Inequality constraints can be conveniently handled by the penalty 
function method as well. The approach is straightforward and illustrated 
by the following example: Minimize L(y) subject to the constraint f(y) 



Figure 12-4 L contours created by large penalty function coefficients. 
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< 0. This problem is solved using the penalty function method by minimiz¬ 
ing 


where P is defined as 


L=L(y) + KPf(y 2 ) 

(12-39) 

p= j 1- (/>0) 

(12-40) 

1 0, (/< 0) 



Examples of pocket calculator optimization with the penalty function 
and gradient methods for programmable pocket calculators follow. 


Example 12-1 Find the stationary value of 



subject to the linear constraint 


f(x,u) = x+ mu — c = 0 


(x is a scalar parameter and a , b , m, and c are constants). 

The curves of constant L are ellipses, with L increasing with the size of 
the ellipse. The line x + mu—c — 0 is a fixed straight line. The minimum 
value of L satisfying the constraint is obtained when the ellipse is tangent 
to the straight line (Figure 12-5). Now 

w= ^(fi + fi) +A (* +mu - c ) 


Thus the necessary conditions for a stationary value are 

a H 


x + mu — c — 0, 


9 * n 2 


- = —r + Am = 0 

a u b 2 


These three equations for the three unknowns, x, u, A, have the solutions 




X = 


u = 


a 2 + m 2 b 2 

b 2 mc 
a 2 + m 2 b 2 


c 


X= — 


a 2 +m 2 b 2 
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Figure 12-5 Example of minimization subject to constraint. 


and the minimum value of L is given by 

L .~ _d_ 

min ? . 2i 2\ 

2(a + m b ) 

Note that 

y dJ dj 

~ X -^ = Tf 


Example 12-2 The gradient optimization method (steepest descent) Con¬ 
sider the problem of minimizing the function 

This problem, as mentioned in the last section of this chapter, is trivial 
from a practical viewpoint but instructive from a pocket calculator op¬ 
timization viewpoint. By inspection we see that L is a minimum aty 1 = 2 
andy 2 = 0. The gradient method is to seek the condition 


0L 

dy 


= 0= VL 


We iteratively solve the implicit equation 




until VL^O. Then y n + x =y n = value of y that minimizes L. For this 
problem 


A yi = 2k 2 (y,-2) 
Ay 2 = 2k 2 y 2 



312 


Optimization 


Then 


>2„,=>'2. + A >’2. 

The key stroke sequence for implementing the gradient optimization 
method on the HP-65 programmable pocket calculator (which is typical of 
the program for any pocket calculator) is shown in Table 12-1. 

This sequence of key strokes was programmed in less than 1 minute. 
Now let us examine the types of numerical analyses that can be made with 
the pocket calculator. First consider the effect of k 2 on the first 10 steps of 
the process of finding the y x and y 2 that minimize L. In Table 12-2 
y x — 1 =y 2 at the start (L 0 — 2). It is apparent that for the case ofy x = 1 —y 2 , 
k 2 = 0.5 leads to the best 10-step estimate of the optimum solution. The 
data for each step can be easily developed once the calculator is pro- 


Table 12-1 Typical keystroke sequence for gradient optimization 


Key Strokes 

Comment 

LBL-A \ 


R/S ! 

Inputs 

STO-1 J 

Store y x in register 1 (R-l) 

R/S ( 

Inputy 2 

STO-2 } 

Input data Store y 2 in R-2 

R/S ( 

Input K 2 

STO-3 \ 

i K 2 -^ R-3 

R/S ) 

| Input N —the number of iterations 

STO-8 / 

A^R-8 

LBL-1 ^ 

Label this step “1” 

RCL-1 

\ Recall y x 

2 

I 


f Oh- 2) 

STO-5 

(yi- 2)-R-5 

/- 1 ' 

\ 

v- | 

' Compute L (y,—2) 2 

RCL-2 

l y 2 

f~ l 

i 

\T 

) y\ 

+ 

1 yl + (y l ~2) 1 = L 

STO-6 > 

L—>R-6 
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grammed. Thus foryj = 1 —y 2 and /c 2 = 0.05 the sequence of solutions is as 
shown in Table 12-3. It appears that when /: 2 = 0.5 the first step accidently 
puts the estimates ofy, andy 2 right on the stationary points. This is seen 
by noting thaty, andy 2 are exactly - 1 for k 2 ~ { andy, = 1 =y 2 . 

Now let us examine the effect of k 2 on the 10-step estimate of the 
stationary points wheny, = 3=y 2 (Table 12-4). Again A' 2 = 0.50 results in 
the best 10-step estimate of the stationary points. The reason for this 
phenomenon might be thought to be that the initial conditions are an 
integer multiple of the step size 0.5 and the gradient is permitting the 
solution to fall precisely on the stationary points by accident. However, a 


Table 12-1 ( Continued ) 


Key Strokes 


Comment 

RCL-5 ' 


O'.-2) 

RCL-3 i 
X ( 

Compute 

k 2 (y\- 2) 

2 ( 



X 1 

- 

II 

<F 

2/c 2 (y, —2) 

CHS j 

f 

-2A: 2 (y,-2) = Ay 1 

RCL-1 

) 

y\ 

4- 

> Compute new y, 

y,+Ay, 

STO-1 

) 

y, + Ay,-*R-l 

RCL-2 i 


^2 

RCL-3 i 


k 2 

X ( 

Compute 

k iyi 


' - 3 L , 


x \ 

\ a 

2 k 2 y 2 

CHS 

) 

— lk 2 y 2 ~ by 2 

RCL-2 "I 

| 

yi 

4- 

> Compute new y 2 

y 2 + A_v 2 

STO-2 1 

( 

y 2 + Ay 2 -^R-2 

DSZ 


Number of steps = A? 

GTO-1 

\ 


If no, go to 1 

RCL-6 

| 

If yes, recall L 

R/S , 

J Display 

Display L 

RCL-1 ( 

T,yi,y 2 

Recall y x 

R/S / 

> 

Display y x 

RCL-2 


Recall y 2 

R/S 


Display y 2 

RTN j 

! 

Return to the top of the program 



Table 12-2 Tenth-Step Results for Various k 2 


ki 

^10 

■^1,0 

- V 2,o 

0.01 

1.39027066 

1.18292719 

0.81707281 

0.10 

0.03602880 

1.89262582 

0.10737418 

0.25 

0.00000763 

1.99902344 

0.00097656 

0.50 

0.00000000 

2.00000000 

0.00000000 

0.75 

0.00000763 

1.99902344 

0.00097656 

1.0 

2.00000000 

1.00000000 

1.00000000 

2.0 

7.748409780X 10 8 

- 5.9047 X 10 4 

5.9049 xl 0 4 


Table 12-3 Iterative Gradient Optimization 
with k 2 — 0.5 


Number of 
Iterations 

L 

.Vl 

.V2 

0 

2 

1 

1 

1 

0 

2 

0 

2 

0 

2 

0 


Table 12-4 Tenth-Step Results for Various k 2 and y x =y 2 = 3 


k 2 

( k) IQ 

(>’)io 

(>’2)10 

0.10 

0.18014399 

2.10737418 

0.32212255 

0.20 

0.00101560 

2.00604662 

0.01813985 

0.40 

0.00000000 

2.00000010 

0.00000031 

0.50 

0.00000000 

2.00000000 

0.00000000 

0.75 

0.00003815 

2.00097656 

0.00292969 

1.00 

10.00000000 

3.00000000 

3.00000000 
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test of this hypothesis is but a few key strokes away, in that we can try the 
initial conditions y, = 77 and y 2 = 2-77 with keyboard entry. Then for /c 2 = 0.5 
we find 

( L ) I0 = 0.00000000 
(t 1) 10 = 2.00000000 
(y 2 ) lo = 0.00000000 

where the sequence of estimates is as shown in Table 12-5. At this point it 
should be clear that k 2 = 0.5 is a unique value that causes the system of 
optimization equations to exhibit the peculiar property that the singular 
points are exactly determined on the first step of the iterative solution. This 
is precisely the case and it serves to illustrate the following key points: 

1. Almost every system of equations has unique numerical properties. 
The analyst must keep ever alert for their discovery. Often it is possible to 
capitalize on them. The pocket calculator is an ideal means for this kind of 
research and exploration. 

2. An understanding of the unique properties leads the analyst to a 
better understanding of the equations he is using. (It is left to the reader to 
determine why this simple system of equations has the property that the 
stationary points are exactly determined numerically when k 2 = j, no 
matter what the values ofyq andy 2 .) 

In what follows we do not use the unique value of K 2 that exactly 
determines the singular points. To further illustrate the gradient method, 
let us use A: = 0.2 and now consider the questions: How do we know we 
have reached the minimum L in a 10-iteration optimization? How can the 
analyst quickly gain confidence that the stationary point is not a local 
minimum of which there is a “deeper” minimum nearby? Mathematically 


Table 12-5 Iterative Gradient Optimization with 
k 2 = 0.5 and Irrational Initial Conditions 


Number of 
Iterations 

L 

y\ 

>2 

0 

40.74165139 

ir 

277 

1 

0 

2 

0 

2 

0 

2 

0 

3 

0 

2 

0 

4 

0 

2 

0 

5 

0 

2 

0 
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there is no guarantee that the gradient method will find THE global minimum 
of his function. Some practical things can be done, however, that give 
confidence in the end result of an optimization analysis, tieing all of these 
questions together and giving some plausible answers. Among them are the 
following: 

1. Stopping the process for a fixed number of steps. This is the simplest 
criterion for terminating the iterative stationary point search process. 

2. Selecting a new set of initial conditions and searching for the 
stationary point using the same number of steps. 

3. Continuing (1) and (2) until convergence from all quadrants around 
the stationary point (initial one found) has been established. 

4. Using the stationary point as the initial conditions and demonstrating 
stability of solution at the stationary point. 

This procedure is quick, and will usually uncover the areas of concern if 
the results are different from initial conditions. Also, because more 
“samples” are available from the stationary point selection process, we 
tend to have more statistical confidence that we have indeed found the 
stationary point. 

Returning to our optimization problem, we find the stationary point to 
be somewhat different when approached from different directions. Table 
12-6 illustrates this point. It is apparent that L is a minimum in the near 
neighborhood of 

Ti=2 

y 2 = 0 


Table 12-6 Ten-Step Gradient Optimization Results when 
Stationary Point Approached from Different Directions 


yi* 

-^2 0 

T\q 

CvOio 

(>' 2)10 

5 

4 

0.002539 

2.01813985 

0.02418647 

5 

5 

0.00345304 

2.01813985 

0.03023309 

4 

5 

0.00294524 

2.01209324 

0.03023309 

-4 

4 

0.00528112 

1.96372029 

0.02418647 

-5 

5 

0.00751544 

1.95767368 

0.03023309 

-5 

4 

0.00660140 

1.95767368 

0.02418647 

-5 

-4 

0.00660140 

1.95767368 

-0.02418647 

-5 

-5 

0.00751544 

1.95767368 

-0.03023309 

-4 

-5 

0.00619516 

1.96372029 

-0.03023309 

4 

-4 

0.0060203120 

2.01209324 

-0.02418647 

5 

-5 

0.00345304 

2.01813985 

-0.03023309 

4 

-5 

0.00294524 

2.01209324 

-0.03023309 
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The 100-step iteration bears this out, resulting in 


L^O 


y, = 1.98790677 
y 2 = 0.00000000 


Example 12-3. The penalty function method Let us consider the problem 
of minimizing the function 

L = Oj-2) 2 +yf 

subject to the constraint 

>’i=0 

This problem, as mentioned before, is trivial from a practical viewpoint but 
is instructive in regard to pocket calculator optimization. Using the 
Courant penalty function method, we form the auxiliary function 

^=(Ti-2) 2 +T2 + ^iT? 

Our objective is to minimize this new function, using the programmable 
pocket calculator. For this example we employ the gradient method of 
optimization to merge the learning of both methods. Here 

a! + 

* )^=2, 2 

' T2 

Then ^ 

( — = — 2[( 1 + 2£,)y. — 2]& 2 = Av, 

Ay = < ^ 

l ~k 2 ^ = -2k 2 y 2 ^Ay 2 

and 

y=y + Ay=l y ' + ^ y ' 

I T 2 + 4v 2 

Programming the pocket calculator typically involves the sequence of 
key strokes shown in Table 12-7. 



Table 12-7 Gradient Optimization with Equality Constraint 


Key Stroke 

Sequence Comment 


LBL-A 


Set program step counter and 
pointer to begin at the first 
place in memory 

R/S v 


Stop for data input 

SRO-1 


Storey, in register 1 (R-l) 

R/S 

STO-2 i 


Storey 2 m R-2 

R/S ( 

STO-3 / 

> Input 
data 

Store k x in R-3 

R/S 

STO-4 ' 

Store k 2 in R-4 

1 

R/S 

STO-8 j 

1 

Store N in R-8 

LBL-1 


Label this step “1” 

RCL-3 \ 


Recall R-3 (&,) 

1 1 
+ 


Add one to k x 

RCL-1 

r 1 


Recall y 1 

v / 


Square y. 

x [ 


y? X(1 + k x ) 

RCL-1 \ 
4 , 

^ Compute 

Recall y, 

X I 

L 

4y, 

l 


i 

r 

RCL-2 1 

r x 


Recall y 2 



y\ 

+ 


(\ + k A )y 2 x -4y x +yj 

4 



+ 1 

1 

(l + /c 1 )yf-4y 1 +yf + 4 = L 

R/S 


Display (L) 

STO-6 \ 


Store L in R-6 

RCL-3 

2 


Recall k x 

X 

1 1 


2k x 
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Key Stroke 

Sequence 

Comment 

4- j 

r 

(1 + 2 *,) 

RCL-1 | 

i Compute 

y i 

X 

2 / 

' *-■’ 3,^ 

y,(l + 2/c,) 

- 1 


(1 + 2k x )y x —2 

RCL-4 


k 2 

X 


k 2 [(\ + 2k x ) yi ~2) 

2 



X j 


2[(1 + 2k,)y, — 2]/c 2 

CHS / 


— 2[(1 +2k ] )y l — 2\k 2 = AY , 

R/S > 


Display A F, 

RCL-1 i 

l 

Fi 

+ \ 

i Compute 

V: 

+ 

II 

R/S | 

| new y, 

Display y, 

STO-1 ) 


Storey, in R-l 

RCL-2 \ 


Recall y 2 

RCL-4 1 


Recall k 2 

X l 

Compute 

k 2 y 2 

2 ) 

J-ar . 


X 

N> 

II 


CHS 

1 

~2k 2 y 2 = Ay 2 

R/S 


Display Ay 2 

RCL-2 

Compute 

Recall y 2 

+ 

y 2 + Ay 2 =y 2 

R/s ( 

new y 2 

Display (y 2 + Ay 2 ) 

STO-2 
DSZ S 

Return to top of 

Storey 2 in R-2 

GTO-1 

program for iteration 


RCL-6 ] 

1 Display L 


R/S j 

RCL-1 j 

1 Display yi 


R/S j 

RCL-2 j 

1 

> Display y 2 


R/S \ 



RTN ; 
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Let us take the example from Section 12-6 (k x ~35). Our first task is to 
find a value of k 2 that will ensure stability in the gradient search process. 
We see from Table 12-8 that k 2 = 0.01 can provide stable iterations at 
k j = 35 and is reasonably large to permit quick convergence (Table 12-8). 


Table 12-8 Gradient Optimization Results for Various k x 



k\ 

k 2 

(L)io 

(>’ 1)10 

(^ 2)10 

Stable 

0.0001 

35 

29.51091749 

0.87049244 

0.99800180 


0.001 

35 

6.55662021 

0.23828719 

0.98017904 


0.01 

35 

4.61180996 

0.028335500 

0.81707281 

Unstable 

0.05 

35 

4.649658075 X 10 15 

6.932487456 X 10 7 

0.34867844 


0.10 

35 

5.032858806 X 10 21 

1.560738178X10 11 

0.10737418 


Using k 2 = 0.01, k { = 35, and y l = \=y 2 , we find after 100 steps that 

(L) 100 = 3.93420285 
(t i) ioo = 0-02816901 
(l 2 ) 100 = 0-13261956 


and after 200 steps that 


(L) 200 = 3.91621180 
(>- 1 ) 2OO =0.02816907 
(>' 2)200 = 0-01758795 

An approach that involves 200 steps but gives more confidence that the 
stationary point is in the neighborhood of the estimate made with the 
program is to make five 40-step searches for the stationary point and, as 
before, begin the searches from different quadrants as well as the 
“average” solution point. The value of k Y is then selected so that in 40 
steps the search will cover the region of the expected solution. In our case, 
we expect a solution in the neighborhood of (0,0), so that 40 steps of 0.05 
will cover the region from — 1 to +1. Then the task is to find a reasonably 
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high value of k 2 that will permit stable iterations. We see from Table 12-9 
that a /c 2 — 7.5 results in stable searches for the singular point. Then four 
40-step searches provide the results given in Table 12-10. This approach 
results in more confidence that the stationary point for 

L(y\,yi) 

subject to the constraint y x = 0 is located at 

y j = 0 (by the constraint) 

y 2 = 0 (by analysis) 


Table 12-9 Gradient Optimization Results for Various k 2 



Number of 
Iterations 

k 2 

k x 

(T)io 

(jP 1)10 

(^ 2)10 


( 10 

1 

0.05 

2.35474379 

0.67608251 

0.34867844 

Qt Q Kip 

< 10 

2.5 

0.05 

3.20535903 

0.33340324 

0.34867844 

kJ lu U1C 

) 10 

5.0 

0.05 

3.62116902 

0.18181818 

0.34867844 


’ 10 

7.5 

0.05 

3.80010179 

0.13029079 

0.34867844 

T I n c t K1 p 

1 10 

10.0 

0.05 

57.99696974 

2.44195747 

0.34867844 

U Ha Id LMC 

i io 

12.5 

0.05 

54447.09556 

101.5703041 

0.34867844 


Table 12-10 Gradient Optimization using Penalty Function Method 
for Satisfying Equality Constraint—Average Result Technique 


Initial Values 


y 1 

y 2 

(k ) 40 

(y 1)40 

(^2)40 

1 

1 

3.63308223 

0.12500000 

0.01478088 

- 1 

1 

3.63308222 

0.12500000 

0.01478088 

- 1 

-1 

3.63308222 

0.12500000 

-0.01478088 

+ 1 

-1 

3.63308223 

0.12500000 

-0.01478088 

Average of all estimates of y x andy 2 0.12500000 

0.00000000 
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or 


y, =0.125 
^2 = 0 


(both by the analysis above) 


We can make a test of the expected answer to complete the 200-step 
process as follows: 


yj =0.125 


y 2 = 0.0 

Then 


(L) 40 = 3.63281250 
(t , 1 ) 40 = 0* 12500000 
(^ 2)40 = 0.00000000 

It is apparent that the stationary point of L(y l ,y 2 ) is in the neigh¬ 
borhood of 

! y, = 0.125, y, = 0.0 ) 

V Stationary point for L 
n y 2 = 0.0, y 2 = 0.0 j 

Of the two approaches to penalty function searches on the pocket 
calculator, the latter is recommended because more information on the 
local topology of L is used to generate the estimate of the values of y, and 
y 2 at the point where L (and thereby L) is a minimum. 


12-7 REFERENCES 

For this chapter consult A. E. Bryson and Y. C. Ho’s excellent book, 
Applied Optimal Control (Blaisdell Publishing Company, 1969), Chapter 1. 
The examples used in this chapter were first presented by Bryson and Ho 
at their outstanding seminar on Applied Optimal Control. 



APPENDIX 1 


SOME TRICKS 
OF THE POCKET 
CALCULATOR TRADE 


In the course of writing this book, a number of interesting “special 
methods” were offered by many colleagues. Unfortunately, the list is far 
longer than might be conveniently included in a single chapter. This 
appendix presents some of these methods, selected on the basis of their 
usefulness in pocket calculator analysis or because they are novel and 
interesting. 


AM 7T AND e ON THE FOUR-FUNCTION CALCULATOR 

An easy-to-remember sequence of numbers, which will generate it with an 
error of only 4x 10 -7 , is 


11 33 55 


We see that this set of numbers is made up of double entries of the first 
three odd digits of the positive numbers. Then 


where 


, _ 355 
”’ - TT3 


= 7T+ € 


c < 4X 10~ 7 
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Some Tricks of the Pocket Calculator Trade 


The key stroke sequence for generating n t on the four-function calculator 
is 


(355) 

(113) 


3.1415929 


A similar ratio generating an approximation of the number e that was 
published by Texas Instruments Corporation in their applications guide is 

193 

—- =2.7183098 

This ratio is not easily remembered, except perhaps by noticing that each 
digit is an odd number and that the digits appear in the sequence 

1 3 5 7 9 1 3 5 

Sequence 1 3 

of occurence N ' Ss \^ 

with the first three in the numerator and the last two in the denominator. 
The result of the ratio is accurate only to the fourth digit (i.e., 2.718), which 
is one digit fewer than must be remembered in the ratio (i.e., 1, 9, 3, 7, and 
1). One might as well memorize e to five places: 

e = 2.71828(1828- • •) 

The author devised a simpler, more accurate, and more easily remem¬ 
bered sequence of zeros and odd numbers (as used in the n sequence): 

00 11 33 55 77 99 

The procedure is as follows: 

1. Cancel the 77 and 11 pairs (symmetric operation). 

2. Put a decimal place after the first zero and a parentheses before the 
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last 9 (symmetric operation) to obtain 

(0.0 33 55 9)9 

This product is e/9 to six digits when rounded at the sixth digit. Thus 
e = (0.0 33 55 9)9 X 9| rounded 

In evaluating e in this manner, the key stroke sequence is 

(0.0 33 55 9) 

X 

(9) 


2.718279 


2.71828 when rounded 


The relative error in this evaluation of e is less than 7x 10“ 5 %. 

To discover such sequences, the calculator becomes a research tool. 
When trying to find interesting ways to generate approximations to often- 
used numbers, one can begin by repeated calculator operations on a 
number and look for an interesting pattern. For example, four divisions of 
77 by 6 on an eight-digit calculator will result in the number 


0.002424 

which is, curiously enough, 


Thus 


6x4x 10 _4 -f6x4x 10~ 6 


7ts6 4 (6x4x 10~ 4 + 6x4x 10 6 ) = 3.141504 
for a relative error of 


0,0000886 

03.1415926 


X 100 = 0.00282% 


Note that only the two even numbers, 4 and 6, are used in this evaluation 
of 77. This particular approximation was worked out at the time of this 
writing as an illustration of the interesting properties of number approxi¬ 
mations that can be found with the aid of a simple eight-digit calculator. 
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A1-2 TRUNCATING A NUMBER ON A CALCULATOR THAT DOES NOT 
HAVE EXPONENTIAL NOTATION 

Truncating a number is the process of reducing a number made up of an 
integer and fractional parts to an integer. For example, when the numbers 

1.21743 

247.41715 

5764.88177 

are truncated, they result in the numbers 

1.0 

247.0 

5764.0 

Notice that there is no rounding. An interesting approach to truncating 
any number in the registers of a pocket calculator that does not have 
scientific notation is the following: 

1. Divide the number by 1000- * •, where the number of zeros fills the 
rest of the display register. 

2. Multiply the result of step (1) by the divisor in step (1)—1000 * • • . 

The result is the truncated number being sought. The key stroke sequence 
is 

(number in display register) 


( 100000 -••) 

X 

( 10000 ---) 


truncated number in 
display register 


Al-3 LUKASIEWIC’S ALGORITHM FOR EVALUATING ANY FUNCTION 
ON A MACHINE WITH REVERSE-POLISH NOTATION PLUS AN 
OPERATIONAL STACK 

Step l Write function in serial form. 

Step 2 Key in first number. 
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Step 3 Compute all functions of the single number and enter them in the 
stack (keyboard operations such as ln(x), 10*, and sin(x). 

Step 4 Compute all 2 number functions and enter them in the stack 
(keyboard operations such as +, —, X, , x y , and xy,). 

Step 5 Key in next number, then repeat steps 3 through 5 until the 
function is evaluated. 

This algorithm is flow-charted as shown in Figure A-l. Clearly, this 
algorithm requires an infinite number of registers in the stack to evaluate 
any function. The lower limit is two registers and a reasonable size is thiee 
for most commonly encountered scientific functions. Hewlett-Packard’s 
HP-35, HP-45, and HP-65 calculators (the most popular Reverse-Polish 
plus stacks machines) all have four registers in their operational stack. 



Figure Al-1 An algorithm for function evaluation on the Reverse-Polish plus stacks 
machines. 
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Example. Consider the expression 

-b 5)[ln(C + Z)) 1/2 + 

This complex expression can be evaluated according to the algorithm with 
the following key strokes on the HP-35 (circled number correlates evalua¬ 
tion to the algorithm flow chart operation): 

key stroke sequence 


(A) 

® 

T 


(B) 

® 

+ 

© 

T 

© 

(C) 

© 

T 

© 

(D) 


+ 

© 

V 

© 

In 

© 

(E) 

© 

+ 

© 

X 

© 


Result in 


Display Register 


Al-4 QUICK POLYNOMIAL APPROXIMATIONS FOR ANALYTIC 
SUBSTITUTION 

A simple, not too accurate, but fast, procedure for developing polynomials 
that can be used to approximate functions is the following: 

1. Identify a number of rational number conditions, x v x 2 ,...,x 3 , under 
which the function takes on rational number values, c,, c 2 , c n . 

2. Prepare a polynomial with coefficients a v a 2 ,...,a n ) which is evaluated 
using simultaneous equations. 
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Example. Approximate sin# with a second-order polynomial on the in¬ 
terval 0< # < 90°. 

# 

Degrees Sin# 

0 0 

30 i 

90 1 

We use a quadratic equation sin#«a, + tf 2 # +# 3 # 2 to approximate sin# 
on the interval 0-90°. The coefficients are determined using the simul¬ 
taneous equations 

0 = U] + <? 2 (0) + <73(0) 
j = a l + a 2 (30) + # 3 (30) 2 
1 = a x + a 2 (9 0) + a 3 (90) 2 

By inspection we see that 

a x = 0 

This system of equations reduces to 

i = tf 2 (30) + apO) 2 
1 = <3 2 (90) + <3 3 (90) 2 

-| = « 2 (90)-3^ 3 (30) 2 
l = « 2 (90) + a 3 (90) 2 

-i = ^[(90) 2 -3(30) 2 ] 

-i -1 -1 

81 X 10 2 —27 X 10 2 54 X 10 2 108X 10 2 

-1 
10800 


Summing we find: 



a 3 = 

.\a 3 = 
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We can now use a 3 to compute a 2 by way of one of the simultaneous 
equations: 


i-* 2 ( 30)- 


900a: 

10800 


, 1/1 900 \ _ 5400 + 900 _ 6300 = 63 

2 30 \ 2 10800 j 324000 324000 3240 


The approximating polynomial then is 


sin# = 

for 0 < 9 < 90°, when 

The characteristics 

Table Al-1 

63 9 0 2 

- + c (0 in degrees) 

0)< 11.14% 

mation are seen given in Table Al-1. 

" 3240 10800 

100(e/sin 

of this approxi 






Maclaurin 





Relative 

Expansion 

B 

( 

63 9 B 2 \ 


Error 

(e- e 4\ 

(degrees) 

sin 0 y 

3240 10800 / 


Percent 


0 

.0 

.0 

lim 1 

f estsi " 9 )- 11.41 





*0 < 

( sm0 / 


5 

0.08715574 

0.09490741 


-8.89 

0.08715570 

10 

0.17364818 

0.18518519 


-6.64 

0.17364683 

15 

0.25881905 

0.27083333 


-4.64 

0.25880881 

20 

0.34202015 

0.35185185 


-2.87 

0.34197708 

25 

0.42261826 

0.42824074 


-1.33 

0.42248706 

30 

0.5 

0.5 


0.00 

0.49967418 

35 

0.57357644 

0.56712963 


+ 1.24 

0.57287387 

40 

0.64278761 

0.62962963 


+ 2.05 

0.64142155 

45 

0.70710678 

0.68750000 


+ 2.77 

0.70465265 

55 

0.76604444 

0.74074074 


+ 3.30 

0.81250684 

60 

0.86602540 

0.83333337 


+ 3.77 

0.85580078 

65 

0.90630779 

0.87268519 


+ 3.71 

0.89111986 

70 

0.93969262 

0.90740741 


+ 3.44 

0.91779950 

75 

0.96592583 

0.93750000 


+ 2.94 

0.93517512 

80 

0.98480775 

0.96396296 


+ 2.22 

0.94258217 

85 

0.99619470 

0.98379630 


+ 1.24 

0.93935606 

90 

1.0 

1.0 


0.0 

0.92483223 
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The table showing a relative error as large as 11.41% should be enough to 
point up the inaccuracy associated with this method. It is worth mention¬ 
ing, however, because the method can be useful for quick curve getting 
through experimental data or measurements that are known only to a few 
percent. 

Al-5 A METHOD FOR COMPUTING RECIPROCALS ON THE FOUR- 
FUNCTION CALCULATOR 

A straightforward, but often overlooked (even by manufacturers in their 
applications manuals) technique for evaluating the reciprocal of a number 
is to enter the number into the display and constant registers (usually done 
automatically for the constant register), divide the number by itself to 
enter 1 in the display register, and then divide again to find the reciprocal. 
The key stroke sequence is shown in Table A1-2. 

Table Al-2 


Automatic 

Constant Manual Constant 

Register Entry Switch Activation 


CO -^Constant key—ObT) 


1 /A in display register 


Al-6 ALPHA-NUMERICS ON THE POCKET CALCULATOR 

Calculators are, by definition, capable solely of numerical manipulations 
and displays. An interesting aspect of our arabic-based alphabet and 
number system is that they have many common symbol shapes. Because of 
this there are words that can be spelled out using numbers. For example, in 
most calculator displays 


BDE5 =8055 

Even more interesting is the fact that some letters are made up of 
upside-down numbers. A typical example is 

3 upside down = E 
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The numbers displayed in the calculator display window that correspond 
to letters are shown in Table A1-3. 


Table A1-3 


Numeric 

Display 

Window 

Corresponding 
Right-side up 
Letters 

Corresponding 

Upsidedown 

Letters 

n 

I | 

0 

0 

l 

1 

“I 

I or l 

I or / 

1J 

“I 

j 


E 

U 

l 

y 

h 

5 

s 

S 

|— 

111 

G 


— i 

I 


L 

ri 

u 

B 

B 

Ei 


G 

Total 

1 

9 


Note that there are three vowels among the upside-down letters but only 
two among the right-side-up letters. Also, there is one more consonant 
among the upside-down letters than among the right-side up letters. The 
author conjectures that it is for this reason that the upside-down set is 
more popular with pocket calculator innovators. 

Examples of the better known pocket calculator “scrabble” words are 
the following (all to be viewed upside down): 


Greeting 
Object 
Proper name 
Adjective 
Expletive 


07734 

38079 

318808 

35007 

57738.57734 X 10 40 
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MATRIX ANALYSIS 

ON THE POCKET CALCULATOR 


Matrix manipulations on the pocket calculator are fairly straightforward 
compared with matrix calculations for general-purpose computing. The 
matrices that the pocket calculator, and even the programmable pocket 
calculator, can operate on are small, hence can be easily manipulated 
manually if problems of ill-conditioned matrices are encountered. Here we 
concern ourselves with basic matrix operations for 2 X 2 and 3x3 matrices. 

The most fundamental matrix operations are those of addition, subtrac¬ 
tion, and multiplication of two matrices. Consider the two matrices 


A = 

a \ 

a 2 

B = 


*2 


a 3 

a 4 


*3 

*4 


The sum of these two matrices is then given by 


A + B = 


a x + b\ 
a 3 +b 3 


a 2 + b 2 
a 4 + b 4 


and the difference by 


A B = 


a \~ b \ 

a 3 -b 3 
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and the product by 

ci\b l + a 2 b 3 a 2 b 4 

a 3 b x + a 4 b 3 a 3 b 2 + a 4 b 4 

The inverse of the 2x2 matrix A is given by 



where 

a x = a 4 /dct 
a 2 = a 3 /det 
a 3 = a 2 f det 
a 4 = a 1 /det 

The determinate can be numerically evaluated as 

det A == ci i@ 4 fl')@ 3 

In a similar fashion, the 3x3 matrix operations can be defined for the 
sum, difference, and product. However, the inverse of a 3x3 matrix is 
defined somewhat differently. Consider now the matrix A defined by 



which has the inverse 
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where the alphas can be numerically evaluated with the equations 

a \~ 2 C 3 ~ ^ 3 ^ 2 )/ det 

a 2 = ( a 3 c 2 — a 2 c 3 )/det 
a 3 = (a 2 b 3 -a 3 b 2 )/det 
«4 = (Vi-Vs)/^ 

a 5~( a i c 3~ a 3Ci)/ det 
a 6 = ( a 3 b l~ a l b 3)/ det 

(x 7 == (b y c *2 £> 2 ^ 1 ) j det 

« 8 =( a 2 c l-^l c 2 )/ det 

a 9 = (a x b 2 — a 2 b x )/det 

Here the determinate can be numerically evaluated with the equation 

det = a x b 2 c 3 + a 2 b 3 c x + a 3 b x c 2 — a 3 b 2 c x — a 2 b x c 3 — a x b 3 c 2 

Clearly the matrix inversion will work only if the determinate is nonzero. 

Another important matrix manipulation that is frequently encountered 
and can be easily evaluated on the pocket calculator is the determination 
of the characteristic equation for the matrix A . That is, 

A — XI = — A 3 + d x X 2 + d 2 X + d 3 = 0 

Here d x through d 3 can be given by the equation 

d x — a x + b 2 + c 3 

d 2 = d 3 c x + a 2 b x + b 3 c 2 — a x b 2 — a x c 3 — b 2 c 3 
d 3 = det = a x b 2 c 3 + a 2 b 3 c x + a 3 b x c 2 — a 3 b 2 c x 
-a 2 b l c,-a l b 3 c 2 

Using these equations, the matrix manipulations associated with many 
vector-matrix operations can be determined for second- and third-order 
matrices. Beyond the second- and third-order matrix analysis, the evalua¬ 
tion on the pocket calculator, though possible, becomes somewhat tedious. 



APPENDIX 3 


COMPLEX NUMBERS 
AND FUNCTIONS 


While it is not the purpose of this book to teach, complex variable theory, 
it will be discussed to review the concepts in complex numbers and 
functions that are pertinent to the evaluation of advanced mathematical 
functions. Complex numbers written in Cartesian form are as follows: 

z = x + iy 

Complex numbers in Cartesian form can be written in polar form: 
z = re 10 — r(cos9 + isinO) 

It is apparent that the modulus of the complex number is 

\z\ = (x 2 +y 2 ) l/2 = r 

Similarly, the argument of a complex number is given by 

arg(z) = tan -1 ^j = 0 

It is common to refer to the real and imaginary parts of a complex 
number: 

Re(z) = R (z) = x = r cos 9 — real part 
Im(z)= I(z)=y = r sin 9 — imaginary part 
The complex conjugate of a complex variable c is given by 

z = x-iy = z c 
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It is clear from the definition of the modulus of a complex number that 

Similarly the equation for the argument of a complex number allows 

ar g( z c) ” — ar §( z ) 

It is worth remembering that the complex conjugate is used in clearing the 
complex form of the denominator of a complex number. 

The multiplication and division of two complex numbers 

z i = jc i -I- /y i and z 2 =x 2 + iy 2 


are given by 


^z 2 = x x x 2 -y x y 2 + i{x x y 2 + x 2 y x ) 

and 


z i Z 1 Z 2 C x l x 2 +y i y 2 + i(x 2 y l -x l y 2 ) 


z 2 \z 2 \ 

It is apparent that 

Similarly we see that 


4 +yl 


I Z 1 Z 21 = I Z 1II Z 2l 


arg(zjz 2 ) = argzj + argz 2 

Ifil 

l z 2 l 

ar8 ^j= ars ( Zi )- ar§ ( z 2 ) 

Powers of complex numbers can be written in polar form: 

which is equivalent to 


z n = r n cosn0 + ir n sinn0, (n = 0, ± 1, ±2,...) 



338 


Complex Numbers and Functions 


In particular, 

z 2 = x 2 —y 2 + i{2xy) 
z 3 = x 3 — 3 xy 2 + i(3x 2 y ->’ 3 ) 
z 4 = jc 4 - 6 xY+y 4 + i(4x 3 y - 4xy 3 ) 
z 5 = X s — \<dx 3 y 2 + 5xy 4 + i(5x 4 y— 10x^ 3 +j> 5 ) 
In general, we can write 



Furthermore, if the nth power of z is written in the form 

z n = u„ + iv„ 

n n 

then 

^ + 1 = ^ + i + iv n+l 

where 

U n+\ = XU n~yVn 
Vn+l= X Vn+y“n 
For negative powers of a complex number, 

I = A. = x ~ iy 

z |zp x 2 +y 2 

and more generally 
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If data are greater than — 77 but smaller than or equal to + 77, equation 
computes the principal root. The other root has the opposite sign, of 
course. The principal root is always given by 

z l/2 = [\(r+ x)^ /2 ±i[^(r- x)] X/1 = u ± iv 

where 2 uv=y and the ambiguous sign is taken to be the same as the sign 
of y. 

In general, then, the nth root of the complex number z is given in polar 
form by 

z l/« _ r \/n e i9/n 

Again, this equation computes the principal root if B is greater than —77, 
but smaller than or equal to +77. The other roots are computed from the 
expression 




APPENDIX 4 


KEY STROKE SEQUENCES FOR 
COMPLEX VARIABLE ANALYSIS 
AND HYPERBOLIC, INVERSE 
HYPERBOLIC FUNCTIONS 


The Hewlett-Packard Company has graciously given permission to publish 
certain pages from their HP-35 MATH PAC that will help the reader 
perform complex variable analysis on the pocket calculator. Key stroke 
sequences for commonly encountered complex variable analysis and 
certain hyperbolic functions are given. They are useful for analysis on 
the HP-35. They are also useful for showing the details of the 
mathematics required to do the analysis on any calculator. The 
Hewlett-Packard corporation has a superb pocket calculator product 
support organization that publishes handbooks of keystroke sequences for 
solving problems in many fields (business, real-estate, science, etc.). The 
HP-35 MATH PAC handbook of keystroke sequences can be obtained 
from any Hewlett-Packard representative or from the Hewlett-Packard 
Company, Advanced Products Division, 10900 Wolfe Road, Cupertino, 
California 95014. 
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Complex Number Operations 


Complex add 

Formula: 

(aj +ib 1 ) + (a 2 + ib 2 ) = (a, + a 2 ) + i(bj + b 2 ) 
= u + iv 


Example: 

(3 + 4i) + (7.4 - 5.6i) = 10.4- 1.6i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a, 

UDcucnan 



ES 

mam 

mmmmm 

u 


D 





4 

— 

b 2 

□□□□□ 

V 



Complex subtract 

Formula: 

(aj +ibj)-(a 2 +ib 2 ) = (a! - a 2 ) + i(b x -b 2 ) 
= u + iv 

Example: 


(3 + 4Q - (7.4 - 5.60 = -4.4 + 9.6i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

l 


mtziann 



2 


□ □□□□ 

U 


3 


ISIHHIHM 



4 

b 2 

laHHIHHI 

V 



Complex multiply 

(Use for short data input) 

Formula: 

(aj + ibj) (a 2 + ib 2 ) = (a 2 a 2 - bjb 2 ) + i(a,b 2 + a 2 bj) 


= u + iv 
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Example: 


(3 + 4i) (7 - 2i) = 29 + 22i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b, 

□□□□□□ 



2 





3 

a 2 

mmmmm 



4 

b 2 

lEniEMEnisnini 



5 


□ □□□□ 

U 


6 

a 2 

lanairairairai 



7 

b 2 

BBliSBMI 

V 



Complex multiply, alternate method 

(Data are entered only once, use for long data input) 

Method: 

If a t , a 2 , bj, b 2 are all non-zero, and if 6 = + d 2 =£90° or-90°, then 

(a! + it^) (a 2 + ib 2 ) = —^(cos Q + i sin 9) = u + iv. 
sin 61 • sin 0 2 

Note: aj + ib x = ^e 1 ® 1 , 
a 2 + ib 2 = r 2 e 102 
Example: 


(3 + 4i)(5- I2i) - 63 - 16i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a, 

□□□□□□ 



2 

b, 

IS33IOIBIB9II3II 



3 

3l 




4 

b 2 

rn mu [*3 mu mu 



5 


| tan | |RCL | fiTI | Ri 1 1 Rl | 



6 


IflkszaiEiiinlEal 



7 



D 

If D = 90 or -90, stop. 



□□□□□ 


Use other method. 

8 


HamaH 



9 


loiDurtaiDiiQaai 



10 


mu mu cm mu mu 



11 


IE1IHIIIS1IH1 

u 


12 

-i 



V 
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Complex divide 

(Use for short data input) 
Formula: 


(a! +ibi) t( a 2 +ib 2 ) - 


(a^ + b x b2) + i(a 2 b! ~a 1 b2) 
a 2 2 + b 2 2 


= u + iv 


Example: 


3 + 4i 
7-2i 


.245 + .64i 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b, 

[!]□□□□ 



2 


cdcdocd^ 



3 

a 3 




4 

bj 

(^CrT][rT][sto][T] 



5 


□□□□□ 



6 

8j 

|RCL|| X 11 Rl II R* 11 fn 1 



7 

bj 

cznzi^jna 



8 


CD [!]□□□ 



9 

bj 

CD CD CD IfDGiD 



10 



U 


11 


n^irnrni—irn 

V 



Complex divide, alternate method 

(Data are entered only once; use for long data input) 

Formula: 

If a!, a 2 , bj, b 2 are all non-zero, and if 6 = - 0 2 =£ 90° or -90°, then 

(a t + ib x ) -r (a 2 + ib 2 ) = (] (cos 6 + i sin 0) = u + iv 
V b 2 / sin 02/ 

Note: aj + ib x = ^e 1 ® 1 , 

a 2 ib 2 = r 2 e i02 . 

Example: 

(63- 16i)-K5- 12i) = 3 + 4i 
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UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

*i 

□ □□□□ 



2 

b, 




3 

a 2 

[!]□□□□ 



4 

b. 

CD Dll) S] EH B 



5 


1 tan | [rCL| HrI] | jjl 11 R* 1 



6 


m isToi nii] m m 



7 


□□□□□ 

D 

If D = 90 or -90, stop. 



□□□□□ 


Use other method. 

8 


|rCl|| Ri]|sTo|| R* || Sin 1 



9 


gBHHH 



10 


ismmmH 



11 


Eiii@nn 

u 


12 


SOOZO 

V 



Complex reciprocal 


Formula: 


Example: 


1 

a + ib 


a - ib 

a^TF 


= u + iv 


1 

2 + 3i 


= .15 - .23i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

□D □ □ □ □ 



2 

a 

rn rn rn fin m 



3 


Li]L"*JI 



4 


finrnrnrnrn 



5 



u 


6 



V 
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Complex absolute value 

Formula: 


ja + ib| = Va 2 + b 2 


Example: 


13 + 4i| = 5 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

BSdnpi 



2 

b 

mmrnf^ii i 




Complex square 

Formula: 


(a + ib) 2 = (a 2 - b 2 ) + i(2ab) = u + iv 


Example: 


(7 - 2i) 2 = 45 - 28i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

QuoiiQnn 



2 ! 

b 

igmBirnrCTBB! 

U 


3 


[asq nn rr~i i~t~i 

v 



Complex square root 


Formula: 


Va + ib = ± 


Example: 



±(u + iv) 


V7 + 6i = ±(2.85 + 1.05i) 
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OPERATIONS 


H DDDD 

r^im im i ii i 

□ □□□□ 
mmnwm 
HHBBE 
CD EZ] □ □ I I 
CD [HD CD DID □ 
□ □□□□ 
jncT] HhsI r^l P5711 | 

B □ CD □ CD 
CD CD CD CD □ 
racii ii ii i 

□□□□□ 


REMARKS 


Complex natural logarithm (base e) 


Formula: 


Example: 


In (a + ib) = In (va 2 + b 2 ) + i 


= u + iv 


In i = 1.57i 


OPERATIONS 


□Dim CD CD C D 

□ □□□□ 

□ □□CD CD 

[3Lx][iix] |chs1CD CD 

jgDD 

□ □□□ 
!□□□□□ 


DISPLAY 


l*^y| 1 r CL| I -- 1| arc | | tan j 
□ □"□□□ 
□ □□□□ 
□ □□CD CD 
!□□□□□ 


If a # 0, go to 4 


If a> 0, go to 9 
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Complex exponential 

Formula: 

e ( a+fl>) = e a ( cos Q + i sin d) = u + iv 

where 

7T 

Example: 

e 1.570796327i _ ^ 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

1 t ||_||_||_||_| 



2 

a 

pi fsToi f^n rn m 



3 





4 


lEaraiDHHi 

U 


5 


l^][5r|(ncr][T](U 

V 



Complex exponential (t a + ib ) 

Formula: 


t a + ib _ e (a+ib)lnt = u + j v 


Restriction: 

(t>0) 


Example: 


2 3+4i = -7.46 + 2.89i 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

mm 

II II 1 



2 

a 

m i i 

□□□ 



3 

t 


x ||xA||RCL| 



4 


IdIdi 

STO | | x«£y | | 1 | 



5 



unum 



6 


rn b 


u 


7 


1 x;± V 11 » ir> 1 

"53LiJL_l 

V 
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Complex number to integral power 

Formula: 

(a + ib) n = r n (cos n0 + i sin n0) = u + iv 
where: r = Va 2 + b 2 , 

n b 

0 - arc tan and 
a 

n is a positive integer. 

Example: 

(3 + 4.5i) 5 = 926.44-4533.47i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

a 


□□□□□□ 



H 

a 

□ □□□□ 


If a * 0, go to 4 


□ □□□□ 



3 


|CLX| |EEx| [CHS| I 9 | | 9 | 



4 


[sToirsnrrimnn 



5 





6 


1 x5£ y| I RCL I 1 * || arc || tan I 


If a> 0, go to 8 



□ □□□□ 



7 





8 


n^irnrnnm 



9 

n 

i 

i 

i 

9 

B 



10 


ED [RCL] f^l fsTol fclxl 



11 


m i *~i m r^i ircli 





£□□□□□ 

u 




[x^J (_sinj [RCl] |_xj | _ | 

V 
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Integral roots of complex number 


Formula: 


i_ i 

(a + ib) n = r n 


0 + 360k . . 0 + 360k 

cos -+ i sin- 


= u k + iv k 

where: n is a positive integer, and 
k = 0, 1, n - 1. 


Example: 

5 + 3i has three cube roots: 
u 0 + ivo = 1.77 + .32i 
u x + iv x = -1.16 + 1.37i 
u 2 + iv 2 = -.61 - 1.69i 




OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

IdBBUhh 



2 

a 

i 

i 

i 

i 

i 


1 f a ^ 0, go to 4 






ill 





4 





5 


[^mrnrnfTTi 



6 




If a > 0, go to 8 



i ii ii ii ii i 



7 


|l||8||0||+|| | 



8 





9 

n 




10 





11 





o 



Uo 


ra 


eshehcbsidIhi 

Vo 


14 


EBIEUMHHI 


Perform lines 14-18 for k = 1, 





2.n-1 

15 


mmmmm 



16 

n 

mmmmim 



17 


StZICZOa 

Uk 




sisifiDczin 

Vk 



349 





































Complex number to a complex power 

Formula: 

If aj + ibj ^ 0, 

( &1 +ib 1 ) (a2+ib 2 ) = e (a 2 +ib 2) ,n ( a i +ib i) = u + iv 

Example: 

(1 + i) (2 - i) = 1.49 + 4.131 




















Complex root of a complex number 

Formula: 

If + ib x =£ 0 
1 

( &1 +ib 1 ) a 2 +ib 2 = e [ln(a 1 +ib 1 ))/(a 2 +ib 2 ) 
= u + iv 

Example: 

Find the (2 - i) th root of 1.49 + 4.126i. 
Answer: 1 + i 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b, 

m 

□ □ 

□m 



2 

a, 

mmmmm 


If a, 0, go to 4 

■ 


nnniMai 

HD 


n 


iiGQisaiOHiniai 

■ 


4 


isEiiisnaiaiai 

II 


5 


mrnmmnAi 



6 



CS1IES9I 

Bicai 



7 




If a, >0, go to 9 



1 1 

mm 

JU 



8 


1 « 1 

mmmm 



9 


hJ 

mem 

mm 





|a|E3|0|0|W| 

|| | 


11 

a i 

1*1 

□ 

□ 

li i 



12 

bj 

|r; | 

ZD (ZD 

mua\ 



ID 


mmmmm 



14 

a 2 


r D 1 R +1 



15 

bj 


Mai 

1H 

■ 


16 


i -1 

mm 

II 1 



17 

b 3 

Ini 

BIB 

EQSlm 

|^wpi«wa| 


18 


l ST0 1 

ZDDD 

RCL 11 + 1 



19 


1 e>< 1 

ZDDD 

«II • 1 



20 


| 0 ] 


mm 



21 


1 c ° s l 

HFI 

_JI_1 

u 


22 


lx5± V 1 

mim 

x Jl . 1 

V 



Logarithm of a complex number to a complex base 


Formula: 


logtai+ibj) (a 2 + *2) = 


In (a 2 + ib 2 ) 
In (ai + ibi) 


351 


































a t + ibj # 0 


Example: log( j+j) (1.49 + 4.126i) - 2 - i 


(a 3 = .34657359, b 3 = .7853981633) 






































































Complex Trigonometric and Hyperbolic 
Functions 

In this section all angles in the equations are in radians. 

Complex sine 

Formula: 

sin (a + ib) = sin a cosh b + i cos a sinh b 
= u + iv 


Example: 

sin (2 + 3i) = 9.154 -4.1689i 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

□ □□□□ 



2 

a 

uiniaioiai 



3 


aiBiaaiBaKsii 



4 


UiniBBliBHai 



5 


mranniiBiiBai 



6 


QIEB9I^9ISS1MI 

u 


7 


BsuiesiBira 



8 


rnmmmm 



9 



V 



Complex cosine 

Formula: 

cos (a + ib) = cos a cosh b - i sin a sinh b 
= u + iv 


Example: 


cos (2 + ei) = -4.189 - 9.l09i 
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UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

□ □□□□ 



2 

a 

rnmsmQ 



3 


! ir | (”+ | [ST0"| | cos ) | x^y | 



4 


mm [id siz] 



5 


mmmmm 



6 


CD CD ED CD CD 

U 


7 


fCLXl 1 + 1 r*i>n U^y | | e* | 



8 


mmmmm 



9 


m m m m □ 

V 



Complex tangent 

Formula: 


tan (a + ib) = 


sin 2a + i sinh 2b 
cos 2a + cosh 2b 


= u + iv 


Example: 


tan (2 + 3i) = -.00376 +1.003i 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

mmmmm 



2 

a 

CD CD CD CD CD 



3 


H0000 



4 


CD CD ED CD CD 



5 


CD CD CD ED ED 



6 


mmmmm 



7 


CD ED [x£yj [STOJ [cLx] 



8 


mmmmim 



9 


CD CD CD ED rn 



10 


ran ii a in. ii« i 



11 


1 » ll * ll,iniRCLll ♦ 1 

u 


12 


mmmmm 

V 1 
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Complex cotangent 

Formula: 


cot (a + ib) = 


sin 2a - i sinh 2b 
cosh 2a - cos 2b 


= u + iv 


Example: 


cot (2 + 3i) = -.0037 -.9968i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

£□□□□□ 



2 

a 

InlalnlsBlal 



3 


miDiomiBii 



4 





5 


| t | ( t 1 1 FH j \TTi ! e* ] 



6 


f 

1 

B 



7 


jx^vll - 1 | RCL [ | x«^y | f STO ] 



8 


iBQimisniaii^l 



9 


iraiSSlDIBIEBSI 





IBIBIBIBIOI 



D 

■ 

iDiaimmcai 



12 


lEBilBMIBIBI 

u 


13 


SCIOtZO 

V 



Complex cosecant 


Formula: 


esc (a + ib) = 


1 

sin (a + ib) 


= u + iv 


Example: 


esc (2 + 3i) = .09 +.0412i 
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Complex arc sine 

Formula: 


arc sin (a + ib) = arc sin 0 + i sgn(b) In (a + Va 2 - 1 ) 
= u + iv 

where: a = i V(a + l) 2 + b 2 + i V(a - l) 2 + b 2 
0 = | V(a + l) 2 + b 2 - i V(a - l) 2 + b 2 


sgn (b) = 
Example: 


1 if b >0 
-1 if b < 0 . 


arc sin (5 + 8i) = .556 + 2.9387i 

Note: Inverse trigonometric and inverse hyperbolic functions are multiple¬ 

valued functions, but only one answer will be given for each of them. 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

1 t 11_11_11_| l_| 



2 

b 

mnnii^iram 



3 


mrnnmm 



4 


Nmnmm 



5 


ramr^nrriHH 



6 


Ml"* II ♦ llv^ll 2 ] 



7 


1 + II t || t IIrclII * I 



8 


|x5ty 1 |RCL| 1 - 1 1 arc 1 i sin 1 



9 


1 . II X || 1 || 8 II 0 1 



10 


[!]□□□□ 

U 


11 


cm mm mm 



12 


mrarnrrnrn 


If b> 0,90 to 14 



□□□□□ 



13 





14 


□□□□□ 

V 
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Complex arc cosine 

Formula: 


arc cos (a + ib) = arc cos 0 - i sgn (b) In (a + Va 2 - 1 ) 
= u + iv 

where: a = i V(a + l) 2 + lb 2 + ~ \/(a - l) 2 + b 2 

fi = i V(a + l) 2 + 1b 2 - i V(a- l) 2 +b 2 


sgn (b) = 


1 if b >0 
-1 if b < 0 


Example: 


arc cos (5 + 8i) = 1.0147 - 2.93 87i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

□□□□□□ 



2 

b 

□H □□ EG 



3 


iiBiuinwKliefl 



4 


IEE3IDIHIDIKS! 



5 





6 


IBIBlfilBIOI 



7 





8 


insiicsimciaieii 



9 


□□ [Z1 □□ HE) □□ 



10 


IBMUMUI 

U 


11 


| Rl 1 I t 1 [T] | x 1 1 1 1 



12 




If b < 0, go to 14 



EMlHRlMlHIIHI 





ISalBlMlBlBl 



14 



V 


























Complex arc tangent 

Formula: 

arc tan (a + ib) = -i |jr - arc tan 
= u + iv 

where: (a + ib) 2 ¥= - 1. 

Example: 

arc tan (5 + 8i) = 1.5142 + .0898i 




























Complex arc cotangent 

Formula: 


Example: 


arc cot (a + ib) = — - arc tan (a + ib) = u + iv 


arc cot (5 + 8i) = .0566 - .0898i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

mmEmn 



2 

a 

IstoI 1 * 1 CD1 tan 1 l x D 



3 





4 





5 


cd i~rti cd s [clx] 



6 


CD CD CD CD CD 



7 


rn cd cd rn i i 

u 


8 


CD CD CD CD CD 



9 


rpr^D cd cd cd 



10 


□ E][T][71@ 



11 


CD CD CD CD CD 



12 


[chs] f 1 CD CD 1 1 

V 
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Complex arc cosecant 

Formula: 


arc esc (a + ib) = arc sin 



= u + iv 


Example: 


arc esc (5 + 8i) = .05598 - .0899i 
(D < 0) 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

[!]□□□□ 



2 

a 

| t || t || xJlSTOJlCLX] 



3 


mrsrinnrarn 



4 


|STO | I * | |CHS | |x5iy | ] RCL| 



5 


minima 

D 


6 





7 


mmmmm 



8 


rarnmrnnn 



9 


[rcl! rn r^n m nn 



10 


mmitksco 



11 


mmrnfiHnnn 



12 


ratinu ii i 



13 


| * | | X 1 I 1 I I 8 I I 0 I 



14 


Fif 11 11 11 1 

u 


15 


mmmrnrn 



16 


□ [^1 d] □□ □□ 


If D>0 / goto 18 



1 II II II II 1 



17 


iHssimmmi i 



18 


i_ii_ii_ii_ii_i 

V 
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Complex arc secant 


Formula: 


arc sec (a + ib) = arc cos 



= u + iv 


Example: 


arc sec (5 + 8i) = 1.5148 + .0899i 
(D < 0) 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

b 

[!]□□□□ 



2 

a 




3 





4 


fSTO~l rn |CHS~| [^71 fRCL] 



5 


niEEiiziinQ 

D 


6 


mnnpnPiTim 



7 


| t 1 [ 1 | | + 1 | t | | X | 



8 


EE] □□ EH □□ EH 



9 


iRn rn rn ni rn 



10 


istoi|ri n nivsjLiJ 



11 


m m ej 



12 


l**V 1 hCLl n 1 arc I [cos | 



13 


| * || x || 1 || 8 || 0 | 



14 



u 


15 


| Ri || T ||t II X 111 I 



16 




If D<0, goto 18 



□□□on 



17 


Ichs 11 ii || n i 



18 


i n M ii ii i 

V 
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Complex hyperbolic sine 

Formula: 

sinh (a + ib) = -i sin i (a + ib) 
= u + iv 


Example: 


sinh (3 - 2i) = -4.1689 - 9.154i 




OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

mnana 



2 

b 

ICHSlI t || 1 1| 8 || 0 ) 



3 


iDinlmsam 



4 


lEainii— imua 



5 


tzniniSELjj 



6 


PH m [RcUl f^Tl fSTOl 



7 


fCLX] | + | | co* 1 PP f e* | 



8 


mmnmm 



9 


[T]l5£t][£Hs][^]a 

U 


10 



V 



Complex hyperbolic cosine 

Formula: 

cosh (a + ib)= cos i (a + ib) 
= u + iv 


Example: 


cosh (1 + 2i) = -.6421 + 1.0686i 
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LINE 

DATA 

OPERATIONS 


REMARKS 

1 

a 

1 t 11_| |_J |_11_j 



2 

b 




3 





4 


|»*y|[ t ll t IfRt |[R< | 



5 

__ . j 

000Q0 1 



6 

1 

1 + ~ll« |[ncq|xg»||sT0| 

U 


7 


|CLX|| ♦ || sin ||x^y|| e« | 



8 


| t | | % | | - | | 2 11 + j 



9 


RNm ii i 

V 



Complex hyperbolic tangent 

Formula: 

tanh (a + ib) = - i tan i (a + ib) 
= u + iv 


Example: 


tanh (1 + 2i) = 1.1667 - .243i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

mnnnn 



2 

b 

Nmmmw 



3 


rnmmmri 



4 


[jrJ UJ [cos] [x^yj |_2j 



5 


nnrnrrirsrim 



6 





7 





8 


|£Lx)l_LJL?yll*' ILLJ 



9 


mnmmw 



10 


1 t ll^vll HI 8 || 0 | 



11 


EEH □□ CH \*E\ f*£Ll 



12 


ii i 

U 


13 


rannni i 

V 
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Complex hyperbolic cotangent 

Formula: 

coth (a + ib) « i cot i (a + ib) 
* u + iv 

Example: 

coth (1 + 2i) - .8213 + . 17138i 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

□□□□on 



2 

b 

eimcnmeg 



3 


□□□□□ 



4 


L' 2 J 



5 


mmmrsnm 



6 


Rimmnnn 



7 


□ □□SES 



8 


UsDSOOO 



0 


1 »II‘A ll - II Ml * 1 



10 


|RCL|| + I (CMS 11■«¥11 1 | 



11 


1 Ml « ll « II * 11 ♦ 1 



12 


fain j|RCL|| + ||«*Y||CHS| 

U 


13 


□ □□□□ 

V 



Complex hyperbolic cosecant 

Formula: 


csch (a + ib) = i esc i (a + ib) 
- u + iv 


Example: 


csch (1 + 2i) = -.2215 - .63549i 
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DATA 

OPERATIONS 

DISPLAY 


i 

a 

rarararara 



2 

b 

[CHS|| t || 1~|| 8 || 0 | 



3 





4 


nn i~ti r^i r^i 



5 


smmum] 



6 


1 HI * | | RCl] | x?^y | |STO | 



7 


[CLxj [""+ j [coir] |xity 11 e* | 



8 


mm EH [El CD 



9 


nnfFcnmnnn^i 

| 


10 


E JL*J[ «D!?cl| | r. | 



11 





12 


[:3dt2P L ir till ll_1 



13 


irararirara 




Complex hyperbolic secant 


Formula: 


Example: 


sech (a + ib) = sec i (a + ib) 
= u + iv 


sech (1 f 2i) = -.4131 - .6875i 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

rarararara 



2 

b 

i^hsi rn n~i rn rn 



3 





4 


rararnrara 



5 


rarararara 



6 


ra ra ra ra ra 



7 


f^irnRn^vinn 



8 


rarararara 



9 


1 x I [CHS | [RC^lJ | t j | x | 



10 


(S3 11 M hi x hrcl] 



11 


| Ri | |xEy | [ Rl [ | + | |STO | 



12 


rarararara 

U 




BB0JB1 1 

V 
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Complex inverse hyperbolic sine 

Formula: 

sinh -1 (a + ib) = -i arc sin i (a + ib) 
= u + iv 


Example: 


sinh" 1 (8 - 5i) = 2.9387 - .556i 





cosh" 1 (5 + 8i) = 2.9387 + 1.0147i 





















QKBI 

i 

a 

2 

b 

3 


J J 


5 


6 


7 


8 


9 


10 


11 


12 




13 


14 


15 



OPERATIONS 



Complex inverse hyperbolic tangent 

Formula: 


Example: 


tanh 1 (a + ib) = -i arc tan i (a + ib) 
= u + iv 

tanh' 1 (8 - 5i) = .0898 - 1.51421 


















Complex inverse hyperbolic cotangent 

Formula: 

cotJT 1 (a + ib) = i arc cot i (a + ib) 


- u + iv 

Example: 

coth" 1 (8 - 5i) = .0898 + .0566i 




















Complex inverse hyperbolic cosecant 

Formula: 


csch* 1 (a + ib) = i arc esc i (a + ib) 
= u + iv 


Example: 


csch -1 (8 - 5i) = .0899 + .055981 
(D < 0) 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

a 

mna 

□□ 



2 

b 

|CHS|| t || t || , IISTOj 



3 


icpFinDmED 



4 


rn [sroi rn iwsi 



5 


mm mi ii i 

D 


6 


mnnisToiF57irn 



7 


CD CD CD 

CD CD 



8 


mi m m m i * i 



9 


Smmmm 



10 


fsioi m rn m m 



11 


LBIMliglEBIMI 



m 


1 X *^Y I |RCL | m Urc 1 f^ n 1 



13 


m i * 11m 

CD CD 



14 


FiMr^nrnrn 



15 


cdcdqgdid] 



16 


□□□CD 

1 II 1 


If D <0,9O to 18. 



□□□ 

1 II 1 



17 


[chTII II 

mm 



m 


□□□ 

i H i 

u 


19 


SUDD] 

i_ ii _i 

V 



370 


























Complex inverse hyperbolic cosecant 

Formula: 

csch -1 (a + ib) = i arc esc i (a + ib) 
= u + iv 


Example: 


csch -1 (8 - 5i) = .0899 + .055981 
(D < 0) 




OPERATIONS 

DISPLAY 

REMARKS 

1 n 

a 

□□□□□□ 



2 

b 

lews II t || t ll X | [STO | 



3 


I.CLX | | + | | FU || x | [Ret] 



4 


| + I |sTO | | + ||CHS||x*y| 



5 



D 


6 


r t~i i«i |sto i igjj i > i 



7 


mmrnmm 



8 


^imnmm 



9 


irclii* iuk 2 ii n 



10 





11 


1 HI ' II t UrclII * | 



12 


(x3ty | jRCL | | - 11 arc | | sin | 



13 


nnnnmrinnr] 



14 


mii^iririmm 



15 


□□□Em 



16 


[ In | | | | | | | | 1 


If D <0, goto 18. 



□ □□□□ 



17 


|CHS|( || |f || || 



18 


□ □□□□! 

u 


19 

i- 


V 



Compound Interest 

See page 96 
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Hyperbolic and Inverse Hyperbolic Functions 


Register usage for the following functions: 

(Same as for keyboard trigonometric functions.) 



x-^-X 
f(x) ►lost 

8--S 


Gudermannian function 

Formula: 

gd x = 2 arc tan e x - ~ 


Example: 


gd 0.345 = 19.386 

Note: - = 90° 

2 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

1 e " 1 ! •« | f=n | 2 | pH 



2 


mrrnrnrnrn 




Hyperbolic sine 

Formula: 

sinh x = —r— 


Example: 


sinh 3.2 = 12.25 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

Ecu [Ei mm 



2 


mmmmm 
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Hyperbolic cosine 

Formula: 


cosh x = 


e + e" 


Example: 


cosh 3.2= 12.29 


2 


OPERATIONS 

DISPLAY 

REMARKS 

1 

X 




2 


ranacnn 




Hyperbolic tangent 

Formula: 

, sinh x , 

tanh x =-= sin gd x 

cosh x 


Example: 


tanh 3.2 = .99668 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

D 

X 

i.nr^ii»"ii » i 



D 

’ 

rnmn^i i 




Hyperbolic cotangent 

Formula: 

coth x = —r— 
tanh x 


Example: 


coth 3.2= 1.003 


Q2H 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

i 

X 

1 2 ll X 1 



2 


mmnmm 
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Hyperbolic cosecant 

Formula: 


csch x = —— 
sinh x 


Example: 

csch 3.2 =.08166 



DATA 

OPERATIONS 

DISPLAY 

REMARKS 

n 


HEDSQtH 



a 

i 

mmi n ii i 




Hyperbolic secant 

Formula: 

sech x = —]— 
cosh x 


Example: 


sech 3.2 = .081 


UNE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

mmmmm 



2 






Inverse Gudermannian function 

Formula: 


Example: 


gd-‘0 = lntan^ + !^ 


gd' 1 30° = .549 


Note: - = 45° 

4 


1^3 


OPERATIONS 

DISPLAY 

REMARKS 

i 

e 

mmmmm 



2 


rnHHi ii i 
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Inverse hyperbolic sine 

Formula: 


sinh 


-l 


; = In [x + (x 2 + l) 2 ] 
= gd" 1 (tan -1 x) 


Example: 


sinh' 1 51.777 = 4.64 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

IjED ged mu □□ mu 



2 

1 

| 5 [ [ + | [ tan | | In | |_| 




Inverse hyperbolic tangent 

Formula: 

tanh -1 x = — In — + — ■■ = gd -1 (sin -1 x) 
2 1 -x 


Example: 


tanh -1 0.777 = 1.038 




OPERATIONS 

DISPLAY 

REMARKS 

1 

2 

X 

|arc || sin || 2 || * || 4 | 




| 5 | | + | | tan 11 in | | | 




Inverse hyperbolic secant 

Formula: r 

sech -1 x = In i + (~ 
x \x 2 



= gd -1 (cos 1 x) 


Example: 


sech' 1 0.777 =.74 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

| arc | | cos | \~2 | p? | p4 | 



2 


000BD 
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Inverse hyperbolic cosine 

Formula: 

cosh -1 x = sech -1 — 
x 


Example: 

cosh" 1 51.777 = 4.64 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

HHStnCI] 



2 


□□ CD □□ H DD 




Inverse hyperbolic cotangent 

Formula: 

coth" 1 x = tanh -1 — 
x 


Example: 


coth" 1 51.777 = .0193. 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

cm h cm cm cm 



2 

i_ 

miBIMlEailEBi 




Inverse hyperbolic cosecant 

Formula: 

csch" 1 x = sinh" 1 — 
x 


Example: 


csch" 1 0.777= 1.0705 


LINE 

DATA 

OPERATIONS 

DISPLAY 

REMARKS 

1 

X 

iKaimicaimm 



2 i 

_i 


miBiiBiBaioi 
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INDEX 


Absolute difference, 5 
error, 87 

Adams' formulas, corrector, 

172 

predictor, 172 

Algebraic data entry methods, 

7 

language, 7, 19 
Aliasing, 144, 150 
Alpha-numerics, 331 
Analytic substitution, 211 
Approximate values, 88 
Automatic landing, 64 

Bessel functions, 129 
interpolation formula, 57, 59 
spherical functions, 131 
Biased estimate, 250 
Binomial series, 91 
Bisection method, 226 
Bode's definite integral for¬ 
mulas, 16 3 

Boolean equations, 14 

Calculator definition, 14 
mechanical, 54 

Central tendency, measure of, 
245 

Chebyshev polynomials, 134, 199 
numerical evaluation of, 210 
Chi-square testing, 268 
Coefficient of excess, 253 
of variation, 251 
Comer functions, 39 
Complex functions, 108, 336 
variables, 108 
zeros, 232 

Computing reciprocals, 331 
Confluent hypergeometric func¬ 
tion, 134 

Cosine integrals, 116 
Courant's penaly function meth¬ 
od, 308 

Curve-fit polynomial, 34 


Data entry methods, 7 
error location and correction, 
67 

extrapolation, 63, 67 
interpolation, 57 
tables, missing entries, 70 
Decrement and jump on zero in¬ 
struction, 287 

Definite integral formula, 154 
integration, 154 
Delete instruction, 288 
Difference, absolute, 5 
forward, 54 
percentage, 5 
tables, divided, 74 
Dispersion, measures of, 249 
relative, 251 
Distribution mode, 247 
shape, measures of, 251 
Divided difference tables, 74 

Equally spaced data, 54 
Error, absolute, 87 
correction, 67 
of data, 89 
of calculation, 89 
estimate, 170 
location, 67 
relative, 43 
roundoff, 40 

trapezoidal integration, 157 
Euler constant, 122 
integral, 122 
integration formulas, 181 
-Maclaurin summation formula, 
95 

predictor formula, 171 
Exponential integrals, 116 

False-position method, 228 
Firmware, 287 
Fixed-point numbers, 4 
Floating-point numbers, 4, 39 
Forward difference, 54 
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Index 


Four-function calculator, 10 
Fourier analysis, 139 
coefficients, 144 
series of continuous func¬ 
tions, 139, 142 
discrete functions, 141, 142 
Frequency distributions, 243 
Fresnel integrals, 125 
Function evaluation, 53 
accuracy in, 38 
Bessel, 129 
Coiner, 39 

confluent hypergeometric, 134 
curve-fit, 34 
error, 126 

Fourier coefficients, 144 
continuous, 142 
discrete, 141, 142 
gamma, 122 

nested parenthetical, 31 
polynomials, 134 
roots, 224 

Gamma function evaluation, 122 
Gauss interpolation formula, 

57 

multiplication formula, 123 
Go-to instruction, 287 
Gradient method, 307 

Hamming midpoint integration 
formula, 159 

12-point formula for Fourier 
analysis, 145 
Hermite polynomials, 134 
Hierarchy, operands, 20 
operators, 8 

Indefinite integral formula, 
154 

numerical integration, 164 
Infinite series, 90 
truncated forms, 92 
Instruction methods, 7 
set, 10 

Instructions, 153, 154 
Integrals, exponential, sine 
and cosine, 116 
Integration, Bode's definite 
integral formulas, 163, 164 


definite, 154 
Euler formula, 166 
-Maclaurin formula, 163 
modified, 167 
predictor, 171 
indefinite, 154, 169 
midpoint, 159 

Newton-Cotes' definite, 165 
open, 163 

predict-correct, 168 
modifying, 170 
Simpson's rule, 162 
Runge-Kutta, 172 
T-, 178 

trapezoidal, 155, 182 
error, 157 

Interpolation formulas, Bessel 
57, 59 
data, 57 
Gauss, 57 
inverse, 62, 75 
Lagrange, 62, 72 
linear, 60 
Newton, 75 
Stirling, 57 

Inverse interpolation, 62, 75 

Jump instruction, 287 

Kummer's transformation, 95 
Kurtosis formula, 252 

Lagrange 5-point interpolation 
formula, 62 

interpolation formula, 72 
Laguerre polynomials, 134 
Lanczos method, 204 
Languages, algebraic, 7, 19 
polish, 7 

reverse-polish, 7, 19 
Linear interpolation formulas, 
60 

systems simulation, 180 
Logical flag instruction, 287 
Lozenge diagram, Kukasiewic's 
algorithm, 58, 326 

Matrix analysis, 233 
Maxima and minima, 301 
Mean, arithmetic, 83, 245 



Index 
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A 

Si 

i 


deviation, 250 
generalized 83 
geometric, 33, 248 
harmonic, 248 

value theoiim, 46 
Memories, 9 
Memory stacl, 7 , 9 
Midpoint in%gration, 159 
Milne’s preset-correct a.’ 

gorithms,175 x tabl 

Missing entres in data t “ 

70 

^of dis>ibution, 24'| 
ms? indefinite 

• ot %thod, 16 / 


es, 



in- 


Nested parer^tical fr / orras ' 31 ' 

90 r 

Newton backwd differ f ence 
formula, . 59 


forward difi- ence ^(..ormula, 
57, 59 f 

interpolati f or mul * a based 
on divide^iff ere Bnces, 75 
method of cU.rminirl / 15 roots, 
29, 229 

-Raphson. fo^ia, .31 
288 


function 



Optimization ,oq 
C ourant's P 6 lty 
method, 3C 
gradient me 13 307 

parameter w^.^^Constraints, 
304 . , a 

without constraints, 30 4 
Overflow, 4 , 39 


Pade's approximation 213 
Pearson's skewness formula, 
I' 252 

Percentage difference, 5 
Pocket calculator, 1 
four-function, 10 
programmable, 283 
Polish data entry method, 7 
Probability, 242, 253 
distribution, 255 
binomial, 255 
Gaussian, 242, 255, 258 


Poisson, 258 

Programmable pocket calculator, 
283 

Progressions, 81 

Rayleigh's generating formula, 
133 

Reciprocals, computing, 331 
Relative dispersion, 251 
error, 43, 87 

Recurrence formula, 199, 210 
Reverse-polish data entry meth¬ 
od, 7 

language, 7, 19 
Root evaluation, mean square, 
249 

Newton’s method, 29, 229 
Roots of a function, 224 
Roundoff error, 40 
Runge-Kutta methods, 172 
Run/stop instruction, 288 

Sampling, 260 
in the small, 265 
Sine integrals, 116 
Single-step function, 288 
Skewness", 252 
Software, 288 

Solutions of cubic equations, 

97 

plane triangles, 105 
polynomials, 96 
quadratic equations, 96 
spherical triangles , 105 
Stable difference equations, 

189 

Starting values, 169 
Statistical estimation, 263 
Statistics, 242 
Stirling's interpolation for¬ 
mula, 57 

Stop instruction, 288 
Successive approximation meth¬ 
ods , 100 

Tail effects, 152 
Taylor's single variable for¬ 
mulas, 90 

Temporary stop instruction, 

288 
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Index 



Test flag instruction, 287 
T-integration, 178 
13-place precision, 75 
Transcendental functions, 102 
Transformation of series, 95 
Trapezoidal error, 157 
formula, 182 
integration, 155 
Triangles, solutions of plane 
105 

spherical, 105 


Tricl cs, 323 

Trun< 2a f - Ln g a numbe"/ 326 


rflow, 39 


ar. propagatin, 194 


correction ac>or, 


Zero; 
of I 


3, complex, 23, 
polynomials, 2$ 
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SCIENTIFIC ANALYSIS 
ON THE 

POCKET CALCULATOR 

By Jon M. Smith 

Your pocket calculator can be an 
amazingly powerful instrument, and 
can even perform calculation? that 
are usually left to large computers. 
This is the first and only book that 
shows you how to get the most out 
of your pocket calculator for scien¬ 
tific and engineering applications— 
you’ll discover more uses than you 
ever thought possible! 

Ideal for all forms of analysis, this 
book gives you numerical tech¬ 
niques, approximations, tables, 
graphs, and flow charts for perform¬ 
ing quick, accurate calculations. The 
fast, efficient methods you’ll learn 
are “sized” for pocket calculator 
analysis and written in forms that 
minimize your workload (thus reduc¬ 
ing your chance for error). The book 
emphasizes methods for such spe¬ 
cific types of data processing as 
Fourier analysis, linear system simu¬ 
lation, gradient optimization, and 
statistical analysis, presented in 
forms that are directly applicable to 
the needs of engineers, scientists, 
and scientific programmers. 

The discussion of each numerical 
method includes: 

• a consistent and careful treatment 
of its use in numerical analysis 

• a detailed discussion of how each 
method is used in pocket calcula¬ 
tor work 

• a description of each method’s li¬ 
mitations and advantages (where 
applicable) 

• a tabulation of useful formulas in 
convenient forms for pocket calcu¬ 
lator analysis 

This book will show you how your 
pocket calculator can— 

• become a “Learning Machine” 

• help you learn new technologies 

• be used for your own research and 
discovery in the field of numerical 
analysis 


About the Author 

JON M. SMITH is a scientist/engi¬ 
neer who, since 1971, has been 
President of Software Research Cor¬ 
poration of St. Louis, Missouri. He 
has published in the fields of physics, 
biomedical engineering, aerospace 
engineering, and mathematics, and 
has given many lectures and semi¬ 
nars on Modern Numerical Methods 
at government, industrial, and edu¬ 
cational institutions. 


